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THE SEVERE ACUTE RESPIRATORY SYNDROME CORONA VIRUS 

All documents cited herein are incorporated by reference in their entirety. 

RELATED APPLICATIONS, FROM WHICH PRIORITY IS CLAIMED 

This application incorporates by reference in its entirety US provisional patent application 
60/462,218, Attorney Reference No. PP2G474.001, filed on April 10, 2003 via Express Mail with 
the US post office, US provisional patent application 60/462,465, Attorney Reference No. 
PP20480.001, filed on April 11, 2003 via Express Mail with the US post office, US provisional 
patent application 60/462,418, Attorney Reference No. PP20480.002, filed on April 12, 2003 via 
Express Mail with the US post office, US provisional patent application 60/462,748, Attorney 
Reference No. PP20480.003, filed on April 13, 2003 via Express Mail with the US post office, 
US provisional patent application 60/463,109, Attorney Reference No. PP20480.004, filed on 
April 14, 2003 via Express Mail with the US post office, US provisional patent application 
60/463,460, Attorney Reference No. PP20480.005, filed on April 15, 2003 via Express Mail with 
the US post office, US provisional patent application 60/463,668, Attorney Reference No. 
PP20480.006, filed on April 16, 2003 via Express Mail with the US post office, US provisional 
patent application 60/463,983, Attorney Reference No. PP20480.007, filed on April 17, 2003 via 
Express Mail with the US post office, US provisional patent application 60/463,971, Attorney 
Reference No. PP20480.008, filed on April 18, 2003 via Express Mail with the US post office, 
US provisional patent application 60/464,899, Attorney Reference No. PP20480.009, filed on 
April 22, 2003 via Express Mail with the US post office, US provisional patent application 
60/464,838, Attorney Reference No. PP20507.001, filed on April 22, 2003 via Express Mail with 
the US post office, US provisional patent application 60/465,273, Attorney Reference No. 
PP20518.001, filed on April 23, 2003 via Express Mail with the US post office, US provisional 
patent application 60/465,535, Attorney Reference No. PP20518.002, filed on April 24, 2003 via 
Express Mail with the US post office, US provisional patent application 60/468,312, Attorney 
Reference No. PP20480.010, filed on May 5, 2003 via Express Mail with the US post office, and 
US provisional patent application 60/473,144, Attorney Reference No. PP20480.011, filed on 
May 22, 2003, US provisional patent application 60/495,024, Attorney Reference No. 
PP20480.012, filed on August 14, 2003 via Express Mail with the US post office, US provisional 
patent application 60/505,652, Attorney Reference No. PP20480.013, filed on September 23, 
2003 via Express Mail with the US post office, US provisional patent application 60/510,781, 
Attorney Reference No. PP20480.014, filed on October 1 1, 2003 via Express Mail with the US 
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post office, US provisional patent application 60/529,464, Attorney Reference No. PP2G480.015, 
filed on December 1 1 , 2003 via Express Mail with the US post office, US provisional patent 
application 60/536,177, Attorney Reference No. PP20480.016, filed on January 12, 2004 via 

Express Mail with the US post office, and US provisional patent application 60/ — , , 

5 Attorney Reference No. PP20480.017, filed on April 7, 2004 via Express Mail with the US post 
office. 

FIELD OF THE INVENTION 

The invention relates to nucleic acids and proteins from Severe Acute Respiratory 
Syndrome (SARS) Virus. These nucleic acids and proteins can be used in the preparation and 

10 manufacture of vaccine formulations for the treatment or prevention of SARS. The invention 
also relates to diagnostic reagents, kits (comprising such reagents) and methods which can be 
used to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
invention also relates to methods for the treatment or prevention of SARS utilizing small 
molecule viral inhibitors and combinations of small molecule viral inhibitors and kits for the 

15 treament of SARS. 

BACKGROUND OF THE INVENTION 

An outbreak of a virulent respiratory virus, now known as Severe Acute Respiratory 
Syndrome (SARS), was identified in Hong Kong, China and a number of other countries around 
the world in 2003. Patients typically had symptoms including fever, dry cough, dyspnea, 
20 headache, and hypoxemia. Isolates of the SARS virus appear to have homology with at least the 
RNA polymerase gene of several known coronaviruses. A phylogenetic analysis of this 
homology is presented in Peiris et al. , "Coronavirus as a possible cause of severe acute 
respiratory syndrome*', Lancet, published online April 8, 2003 at 

http:/rmageAhe]anceLcom/extm incorporated herein by reference in its entirety. 

25 Other sequenced fragments of the SARS virus genome appear to overlap with the open reading 
frame lb of coronaviruses. See, Drosten et al, "Identification of a Novel Coronavirus in 
Patients with Severe Acute Respiratory Syndrome", New England Journal of Medicine, 
published online at http://mvw.nejm.org on April 10, 2003, incorporated herein by reference in its 
entirety. 

30 The Genome Science Center in British Colombia, Canada published on its website 

(http://www.bcgsc.ai/bioW a draft genome assembly of 29,736 base pairs of a virus believed 
to be a SARS virus, referred to as the TOR2 isolate. This draft genome assembly is given herein 
asSEQIDNO: 1. 

The Centers for Disease Control (CDC) published a nucleotide sequence of a SARS-CoV 
35 . strain (SEQ ID NO: 2) on its website Qtttp://www.cdc.gov/ncidod/ The CDC 
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has also published a phylogenetic tree of the predicted N, S and M proteins (attached as FIGURE 
6). This tree places die SARS vims outside any of the previously known coronavirus groups. 

There is a growing need for prophylactic or therapeutic vaccines against the SARS virus as 
well as diagnostic and screening methods and compositions to identify the presence of the virus 
5 in, e.g. , mammalian tissue or serum. 

SUMMARY OF THE INVENTION 

The invention relates to nucleic acids and proteins from Severe Acute Respiratory 
Syndrome (SARS) virus. These nucleic acids and proteins can be used in the preparation and 
manufacture of vaccine formulations for the treatment or prevention of SARS. Such vaccine 

10 formulations may include an inactivated (or killed) SARS vims, an attenuated SARS virus, a 
split SARS virus preparation and a recombinant or purified subunit formulation of one or more 
SARS viral antigens. Expression and delivery of the polynucleotides of the invention may be 
facilitated via viral vectors and/or viral particles. 

The invention also relates to diagnostic reagents, kits (comprising such reagents) and 

15 methods which can be used to diagnose or identify the presence or absence of a SARS virus in a 
biological sample. The invention further includes non-coding SARS viral polynucleotide 
sequences, SARS viral sequences encoding for non-immunogenic proteins, conserved and 
variant SARS viral polynucleotide sequences for use in such diagnostic compositions and 
methods. 

20 The invention further relates to vaccine formulations comprising one or more SARS virus 

antigens and one or more other respiratory vims antigens. Additional respiratory virus antigens 
suitable for use in the invention include antigens from influenza vims, human rhinovirus (HRV), 
parainfluenza vims (PIV), respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and 
rhino vims. The additional respiratory virus antigen could also be from a coronavirus other than 

25 the SARS coronavirus. Preferably, the additional respiratory virus antigen is an influenza viral 
antigen. 

The compositions of the invention may further comprise one or more adjuvants. Adjuvants 
suitable for use in the invention include mucosal, transdermal or parenteral adjuvants. Mucosal 
adjuvants suitable for use in the invention include detoxified bacterial ADP-ribosylating toxins, 
30 such as E. coli heat labile toxoids (e.g„ LTK63), chitosan and derivatives thereof, and non-toxic 
double mutant forms of Bordetella pertussis toxoids. Parenteral adjuvants suitable for use in the 
invention include MF59 and aluminum or aluminum salts. 

The invention also provides methods for treating SARS by administering small molecule 
compounds, as well as methods of identifying potent small molecules for the treatment of SARS. 
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In one aspect of the invention a method of identifying a therapeutically active agent is 
provided comprising: (a) contacting the therapeutically active agent with a cell infected with the 
SARS virus; (b) measuring attenuation of a SARS related enzyme. 

In a more particular embodiment, the therapeutically active agent is a small molecule. In 

5 another more particular embodiment, the therapeutically active agent is a nucleoside analog. In 
another more particular embodiment the therapeutically active agent is a peptoid, oligopeptide, 
or polypeptide. Li another embodiment the SARS related enzyme is SARS protease. In another 
embodiment the SARS related enzyme is SARS polymerase. In still another embodiment the 
SARS related enzyme is a kinase. Methods of identifying therapeutically active agents for 

10 treatment of SARS virus infection are further discussed in Section V below. 

In another aspect of the invention a method of treating a human infected with SARS is 
provided comprising administering a small molecule to a patient in need thereof. In one 
embodiment the small molecule is an inhibitor of SARS protease. In another embodiment the 
small molecule is an inhibitor of SARS polymerase. In another embodiment the SARS related 

15 enzyme is a kinase. In still another embodiment the small molecule is administered orally or 
parenterally. 

The invention also provides the use of such small molecules in the manufacture of a 
medicament for the treatment of severe acute respiratory syndrome. 

Small molecule compounds of the present invention include those of less than 1000 g/mol, 

20 preferably with an aromatic region and greater than one heteroatom selected from O, S, or N. 
Preferred small molecules include, but are not limited to acyclovir, gancyclovir, vidarabidine, 
foscamet, cidofovir, amantidine, ribavirin, trifluorothymidine, zidovudine, didanosine, 
zalcitabine, and combinations thereof. Interferons may also be used for treating patients, 
including interferon-a and interferon-p. Interferon treatment has shown promise in treating 

25 SARS in monkeys (Enserink (2004) Science 303: 1273-1275), particularly when pegylated 
(Haagmans et aL (2004) Nature Medicine 10:290-293). 

One aspect of the present invention relates to methods for identifying individuals exposed 
to, and biological samples containing SARS virus (S ARSV), and to kits for carrying out the 
methods. Such methods can utilize nucleic acid detection techniques such as PCR, RT-PCR (the 

30 Coronaviridae are RNA viruses), transcription-mediated amplification (IMA), ligase chain 
reaction (LCR), branched DNA signal amplification assays, isothermal nucleic acid sequence 
based amplification (NASB A), other self-sustained sequence replication assays, boomerang 
DNA amplification, strand-displacement activation, cycling probe technology, or combinations 
of such amplification methods. Such nucleic acid detection techniques utilize oligonucleotides 

35 having nucleotide sequence similar to, or complementary to, the SARS viral genome, as primers 
(e.g., for amplification) and as probes (e.g., for capture or detection), as is well known in the art. 
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Alternatively, or in addition to the nucleic acid detection methods described supra, the 
methods of the present invention can utilize various immunoassay techniques for detection of 
SARS V antigens and/or antibodies. 

Accordingly, the present invention relates to methods of identifying individuals exposed to 

5 SARSV, or biological samples containing SARSV, by detecting the presence of SARSV 
antigens using antibodies which specifically bind to the same. Hie antibodies are preferably 
monoclonal antibodies. Quantification of the amount of viral antigens present in a sample of an 
individual may be used in determining the prognosis of an infected individual. Preferably, the 
SARSV antigens to be detected are generally one of the structural proteins, particularly those 

10 present on the surface of the viral particles and include, for example, the spike glycoprotein (S), 
also called E2; the envelope (small membrane) protein (E), also called sM; the membrane 
glycoprotein (M), also called El; the hemagglutinin-esterase glycoprotein (HE); also called E3; 
and the nucleocapsid phosphoprotein (N). In preferred embodiments, the antigens to be detected 
are the S, E and M proteins using antibodies to the same. 

15 The present invention relates to kits for identifying individual SARSV and reagents used in 

such kits. The kits comprise a first container which contains antibodies which specifically bind to 
a SARSV antigen and a second container which contains the SARSV antigen. The antibodies are 
preferably monoclonal antibodies. The kits may be adapted for quantifying the amount of 
antigen in a sample of an individual. Such information may be used in determining the prognosis 

20 of an infected individual. 

The present invention relates to methods of identifying individuals exposed to SARS virus, 
or biological samples containing SARSV, by detecting the presence of antibodies against SARS 
virus antigen in a sample using SARS antigen. Quantification of the amount of anti-S ARS 
protein from SARS antibodies present in a sample of an individual may be used in determining 

25 the prognosis of an infected individual. Any one or more of the viral proteins (structural proteins 
or nonstructural proteins) may be used as antigen to detect the SARSV antibodies; preferahly a 
SARSV antigen that is conserved amoung SARSV isolates is preferred. In this regard, 
nonstructural protein (e.g. 9 Pol, Hel, 3CLp, MP, PLP1, PLP2) may be particularly useful. 
The present invention relates to kits for identifying individuals exposed to SARS and 

30 reagents used therein. The kits comprise a first container which contains antibodies which were 
produced in response to exposure to an antigen from SARS virus and a second container which 
contains the SARS antigen(s). The kits may be adapted for quantifying the amount of anti-S ARS 
antibodies present in a sample of an individual. Such information may be used in determining the 
prognosis of an infected individual. 

35 Hie present invention relates to methods of identifying individuals exposed to SARS virus, 

or biological samples containing SARSV, by detecting the presence of nucleic acid from SARS 
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vims. Quantification of the amount of SARS nucleic arid present in a sample of an individual 
may be used in determining the prognosis of an infected individual. Hie methods utilize 
oligonucleotide probes and/or primers that are similar or complementary in sequence to the 
SARSV genome or transcription or replication products. Preferred probes and primers are 

5 described herein. Also included in the present invention are kits for carrying out the methods of 
detecting die SARSV nucleic acid. 

Hie invention further includes a method for the treatment and/or prevention of SARS 
through the administration of a therapeutically effective amount of at least one antiviral 
compound from among those described in the US Patents and published international patent 

10 applications listed in Table 1 and Table 2. In one embodiment of the method, the antiviral 
compound is a small molecule. In another embodiment, the antiviral compound is a protease 
inhibitor. In a further embodiment, the antiviral protease inhibitor is a 3C-like protease inhibitor 
and/or a papain-like protease inhibitor. In another embodiment, the antiviral compound is an 
inhibitor of an RNA-dependent RNA polymerase. In another embodiment, a first antiviral 

15 compound which is a protease inhibitor is administered with a second antiviral compound which 
is an RNA-dependent RNA polymerase inhibitor. The invention further provides for the 
administration of a steroidal anti-inflammatory drug in combination with at least one antiviral 
compound, for example, from the antiviral compounds described in the documents listed in 
Table land Table 2. 

20 The invention further provides for a method for the treatment and/or prevention of SARS 

through the administration of a therapeutically effective amount of at least one antiviral 
compound from among those described in the US Patents and published international patent 
applications listed in Table 1 and Table 2 by inhalation. In one embodiment of the method, the 
antiviral compound is a small molecule. In another embodiment, the antiviral compound is a 

25 protease inhibitor. In a farther embodiment, the antiviral protease inhibitor is a 3C-like protease 
inhibitor and/or a papain-like protease inhibitor. In another embodiment, the antiviral compound 
is an inhibitor of an RNA dependent RNA polymerase. In another embodiment, a first antiviral 
compound which is a protease inhibitor is administered with a second antiviral compound which 
is an RNA-dependent RNA polymerase inhibitor. The invention further provides for the 

30 administration of a steroidal anti-inflammatory drug in combination with at least one antiviral 
compound, for example, from the antiviral compounds described in the documents listed in 
Table 1 and Table 2 by inhalation. The steroidal anti-inflammatory drug may be administered by 
inhalation for a local effect or administered for systemic absorption such as via an oral or 
intravenous route. 

35 The invention further provides the use of an antiviral compound, as defined above, in the 

manufacture of a medicament for the treatment of severe acute respiratory syndrome. 
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Hie invention further provides for a kit for use by a consumer for the treatment and/or 
prevention of SARS. Such a kit comprises: (a) a pharmaceutical composition comprising a 
therapeutically effective amount of at least one antiviral compound from among those described 
in the US Patents and published international patent applications listed in Table 1 and Table 2 
5 and a pharmaceutical^ acceptable carrier, vehicle or diluent; (b) a container for holding the 
pharmaceutical composition; and, optionally; (c) instructions describing a method of using the 
pharmaceutical compositions for the treatment and or the prevention of SARS. The kit may 
optionally contain a plurality of antiviral compounds for the treatment of SARS wherein the anti 
viral compounds are selected from 3C-like protease inhibitors and papain-like protease 

10 inhibitors. In a further embodiment, the kit contains an antiviral compound which is an RNA- 
dependent RNA polymerase inhibitor. When the kit comprises more than one antiviral 
compound, the antiviral compounds contained in the kit may be optionally combined in the same 
pharmaceutical composition. 

An additional aspect of the invention provides for the use of at least one of the antiviral 

IS compounds described in the US Patents and published international patent applications listed in 
Table 1 and Table 2 for the manufacture of a medicament for the treatment or prevention of 
SARS. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1: Schematic of coronavirus genome organization. 

20 FIGURE 2: Schematic of coronavirus ORFla/ORFlb gene products. 

FIGURE 3 (A - C): Alignment of coronavirus polynucleotide sequences for selected genes 
(including nucleocapsid (N), matrix (M), and hemagluttinin-esterase (HE)). 

FIGURE 4 (A - F): Alignment of coronavirus polypeptide sequences (including ORFla/ORFlb, 
nucleocapsid (NP), hemagluttinin-esterase (HE), envelope (Sm or E), matrix (M), and spike (S). 

25 FIGURE 5: Alignment of spike (S) polypeptide sequences, taken from Figure 4, in the region of 
the junction of the SI and the S2 domains, and protease cleavage site for selected coronaviruses. 

FIGURE 6: CDC phylogenetic tree of SARS-CoV strain (Clustalx 1.82, neighbor-joining tree). 
Figure 6A shows coronavirus N protein analysis, Figure 6B shows coronavirus S protein 
analysis, and Figure 6C shows coronavirus M protein analysis. 

30 FIGURE 7: Conserved and specific sequence of the SARS virus. Figures 7A-7D show multiple 
sequence alignments (CLUSTAL W 1.82) of the structural proteins of the SARS virus genome 
(7 A: PEP4 Spike protein; 7B: PEP7 small membrane protein; 7C: PEP8 matrix glycoprotein; 7D: 
FEP13 nucleocapsid protein), which have counterparts in all or some of the other known 
coronaviruses. Figures 7E-7H show dendrograms reporting the protein distances among the 
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sequences in alignments 7A-7D. Labels 229E: human coronavinis; MEV: murine hepatitis virus; 
TGV: transmissible gastroenteritis virus; AIBV: avian infectious bronchitis virus; BOVINE: 
Bovine coronavinis; PEDV: porcine epidemic diarrhea virus. 

FIGURE 8: Alignment of the 5TJTR of several coronaviruses, to show consensus nucleotide 
5 sequence at the 5TJTR. 

FIGURE 9: Sequences of preferred primers for amplification of the 5TJTR. F and R denote 
forward and reverse PCR primers, and the numbers indicate nucleotide positions withing Figure 
8. 

FIGURE 10: Alignment of the 3TJTR of several coronaviruses, to show consensus nucleotide 
10 sequence at the 3TJTR. 

FIGURE 1 1 : Sequences of preferred primers for amplification of the 3UTR. F and R denote 
forward and reverse PCR primers, and numbers indicate nucleotide positions within Figure 10. 

FIGURE 12: Coiled-coil prediction for SEQ ID NO: 6042, using Coils program (Figure 12A) or 
LeanCoil (Figure 12B). 

15 FIGURE 13: Example of insertion of a reporter gene-of-interest at a site between exisiting SARS t 
virus genes. Small nonstructural gene products are not depicted schematically. 

FIGURE 14: Schematic depicting representative examples of SARS virus replicons. Small 
nonstructural gene products are not depicted schematically. 

FIGURE 15: SARS virus nsp2 proteinase (3CLp) and identification of catalytic and substrate 
20 sites. 

FIGURE 16: alignment of SARS virus nsp2 proteinase (3CLp) with that of avian IB V, MHV, 
and BCoV. Residues in dotted boxes are key residues the substrate sites (F, Y & H); residues in 
solid boxes are catalytic cysteine (C) and histidine (H) residues. 

FIGURE 17: Genome organization of SARS coronavinis. Replicase and structural regions are 
25 shown, along with the predicted products of cleavage within ORFla and ORFlb. The position of 
the 5* RNA leader sequence (L), the 3' poly(A) tract and the ribosomal frame-shift consensus 
between ORFla and ORFlb are also indicated. Each box represent a protein product. They are, 
shaded according to the level of amino acid identity with corresponding proteins of other 
coronaviruses (see also Table 2). The SARS-specific genes are white. Positions of the 9 SARS- 
30 specific six-base IG sequences (5'-ACGAAC-3 f ; SEQ ID NO 7293) are indicated by arrows. 

FIGURE 18: Genome organization of Coronaviruses representative of group 1 (HCoV-229E, 
accession number AF304460), group 2 (mouse hepatitis virus MHV, accession number. 
NCJ)01846), group 3 (avian infectious bronchitis virus AIBV, accession number NC_001451) 
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and SARS coronaviius. Other completely sequenced coronaviiuses used in this study are 
available at the following accession numbers: porcine epidemic diarrhea virus (PEDV), 
AF353511; transmissible gastroenteritis virus (TGV), NCJ0O23O6; Bovine coronavirus (BCoV): 
AF220295. Red boxes represent group-specific genes. The position of the leader RNA sequence 
5 and poly(A) tract is also indicated in genomes where they are reported. The position of specific 
IG sequences is indicated by circles of different shades. In the SARS genome^ we also find three 
IG sequences specific for group 2 coronavirus. . 

FIGURE 19: Topological model predicted for the spike protein anchored to the viral membrane. 
Structural and predicted functional domains are indicated. The N-terminal region (SI) is 
0 predicted to contain the receptor binding domain. Two coiled coil regions within the S2 domain, 
partially superimposed to leucine zipper motifs are presumably involved in oligomerization. The 
hydrophobic domain is responsible for membrane anchoring. 

FIGURE 20: Phylogenetic tree obtained from the multiple sequence alignment of a 922 bp 
internal region of the pol gene from 12 coronaviruses and SARS. Numbers at the nodes represent 

15 the result of a bootstrap analysis and strongly support the branches. Sequences not available 
within the complete coronavirus genomes have been retrieved from GenBank at the following 
accession numbers: nonagglutinating encephalomyelitis virus of swine (PHEV), AF124988, 
Human OC43 virus (OC43), AF124989, canine coronavirus (CCV), AF124986, feline infectious 
peritonitis virus (FIPV), AF124987, turkey coronavirus (TCV), AF124991, syaloacryoadenitis 

20 virus of rats (SDAV), AF124990. 

FIGURE 21: 21A . Unrooted tree obtained from the alignment of consensus sequences of the 
group I and group H SI domain of spike proteins (Gl_cons and G2_cons) with those of a group 
3 spike (AIBV) and the spike of SARS virus. The number indicates the result of a bootstrap 
analysis. The sequences used to generate the consensus profile from group 1 are: HcoV-229E, 

25 accession number P15423; porcine epidemic diarrhea virus (PEDV), acc no: NP_598310; 

transmissible gastroenteritis virus (TGV), acc no: NP_058424; Canine coronavirus (CCV), acc 
no: S41453; porcine respiratory virus (PRV), acc no: S24284; feline infectious peritonitis virus 
(FIPV), acc no: VGIH79. The sequences used to generate the consensus profile from group 2 
are: mouse hepatitis virus (MHV), acc no: NP_045300; Bovine coronavirus (BCoV), acc no: 

30 NP_150077; Human coronavirus OC43, acc no: P36334; hemagglutinating encephalomyelitis 
virus of swine (PHEV), acc no: AAL80031; for group 3, only the sequence of the spike protein 
of avian infectious bronchitis virus (AIBV), acc no: AA034396 was used. 21B.: Schematic 
representation of cysteine positions in SI domains of group 1, 2 and 3, compared to the SARS 
spike. Horizontal bars represent the SI amino acid sequences (in the case of SARS and AIBV) or 

35 the consensus profiles (generated from group 1, Gl_cons, and from group 2, G2_cons). The 
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length of the bars are not to scale. Relative cysteine positions are indicated by rectangle bars. 
Only cysteines perfectly conserved within each consensus are reported. lines connect cysteines 
conserved between the SARS S 1 domain and the consensus sequences as shown. 

FIGURE 22: illustration of a Neisseria Adhesin A protein (NadA). 

5 FIGURE 23: Raw translation from SARS coronavirus genome (reading frame +1). 

FIGURE 24: Raw translation from SARS coronavirus genome (reading frame +3) 

FIGURE 25: lb and Spike open reading frames, separated by *. 

FIGURE 26: SARS growth in vero cells. 

FIGURE 27: chromatogram of the capture step of SARS coronavirus on Matrix CeDufine 
10 Sulfate Superformance 150/10. Analysis was on 100ml coronoavirus harvest. Hie left Y axis 
shows absorbance at 280nm. The right Y axis shows the gradient (%B). Hie X axis shows the 
volume (ml). 

FIGURE 28: Silver-stained MCS chromatography fractions. Lanes are: (1) marker, 

(2) coronavirus vero cell harvest; (3) coronavirus vero cell harvest, after 0.65jim filtration; 
15 (4) flowthrough; (5) wash; (6) 20% peak (virus peak). Lanes were loaded with 1 jig of test 

protein. 

FIGURE 29: Western Blot of MCS chromatography fractions. Lanes are as described for Fig.28. 

FIGURE 30: linear density gradient ultracentrifugation, 15-60% sucrose (SW28, 2 hours, 
20000 rpm). The graph shows protein concentration (■) and sucrose concentration (♦). 

10 FIGURE 31: Silver-stained density gradient fractions on NuPage 4-12% Bis-Tris-Ge (Novex), 
reduced conditions, heated for 10 minutes at 70°C. Lanes are: (1) marker, (2) 20% peak MCS; 

(3) density gradient fraction 11; (4) density gradient fraction 12; (5) density gradient fraction 13; 
(6) density gradient fraction 14; (7) density gradient fraction 15; (8) density gradient fraction 16; 
(9) density gradient fraction 17. Hie bulk of proteins was in fractions 15 to 17. Lanes 2, 8 and 9 

15 were loaded with 1/ig protein. 

FIGURE 32: Chromatogram of the Capture Step of SARS coronavirus on MCS. Details are as 
for Figure 27, except that 200ml harvest was used. 

FIGURE 33: Silverstain (left) and Western Blot (right) of chromatographic fractions. Lanes are 
as described for Figures 28 and 29, except that lane (6) is the 5% peak. Treatment before 
\0 SDS-PAGE was at room temperature for 30 minutes. 

FIGURE 34: Density Gradient Ultracentrifugation, 15-40% sucrose (SW28, 2 hours, 20000 
rpm). The graph shows protein concentration (■) and sucrose concentration (♦). 
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FIGURE 35: Silverstain (left) and Weston Blot (right) of Density Gradient Ultracentrifugation 
fractions on NuPage 4-12% Bis-Tris-Ge (Novex), reduced conditions. Lanes are: (1) marker; (2) 
density gradient fraction 6; (3) density gradient fraction 7; (4) density gradient fraction 8; (5) 
density gradient fraction 9; (6) density gradient fraction 10; (7) density gradient fraction 15. 
5 Fractions 7-10 (lanes 3-6) contained pure coronavirus proteins. The bulk of impurities was in 
fraction 15 (lane 7). Lanes 2, 8 and 9 were loaded with ~l/ig protein. Treatment before 
SDS-PAGE was at room temperature for 30 minutes. 

FIGURE 36: EM pictures of Density Gradient Fractions 8-10. figure 36A shows fraction 8; 
figure 36B shows fraction 9; figure 36C shows fraction 10. 

10. FIGURE 37: Spike/NadA fusion constructs. 

FIGURES 38 and 39: Results of the expression in Kcoli of SIl, Sl L -NadA and S^-NadA**^. 
figure 38 shows SDS-PAGE analysis of total lysates from BL21(DE3)/pET, BL21(DE3)/pKr- 
S1 L and BL21(DE3)/pET-Sl L -NadAAanchor- The bands are indicated by an arrow, and the three 
lanes are,. from left to right BL21(DE3)/pET; BL21(DE3)/pET-Sl L ; BL21 (DE3ypET- - 

15 Sl L -NadAAanchor. Figure 39 shows (39A) SDS-PAGE and (39B) western blot analysed of total 
lysates from BL21(DE3)/pET, BL21(DE3)/pET-Sl L -NadA (grown under un-induced condition) 
and BL21(DE3)/pET-S l L -NadA (grown under induced condition). The bands are indicated by an 
anrow, and lanes are, from left to right BL21(DE3)/pET; BL21(DE3)/pET-Sl L -NadA; 
BL21(DE3)/pET-Sl L -NadA. The western blot shows the presence of oligomeric forms of the 

10 protein. 

FIGURE 40: Schematic of SARS Spike clones. 

FIGURE 41: Transient Expression of SARS Spike Proteins (western blot of COS7 cell lysate). 
Each lane of the 4-20% TG SDS gel was loaded with 20/xg cell lysate (total 1.2mg). Hie labeling 
antibodies are shown. 

\5 FIGURE 42: Western blot analyses of COS7 cell lysates on 4% TG SDS gel showing 
oligomerization state of intracellular S molecules. 

FIGURE 43: Western blot analyses of COS7 cell lysates on 4-20% TG SDS gel showing 
Transient Expression of SARS Spike Proteins. Lanes are: (1) mock, AF; (2) mock, DF; (3) nSh, 
AF; (4) nSh, DF; (5) nShATC, AF; (6) nShATC, DF. Each lane was loaded with 5/il of each 
0 sample, 400/il total. The blot was labeled with antibody against the EBs-tagged protein. 

FIGURE 44: Western blot analyses of COS7 cell medium on 4-20% TG SDS gel showing 
Transient Expression of SARS Spike Proteins. Truncated spike protein is secreted. Spike 
proteins were purified from the culture medium (from a 10cm plate), first by a ConA column and 
then finally by ESstag Magnetic beads. Each lane was loaded with one third of the material. 

-11- 



WO 2004/092360 PCT/US2004/011710 

FIGURE 45: Weston blot analyses of COS7 cell lysates on 4-20% TG SDS gel showing 
glycoslation of SARS spike proteins. In the two left-hand blots (lanes 1-5), samples were boiled 
in SDS and |}-mercaptoethanol; in the two right-hand blots (lanes 6-1 1), samples were in SDS 
only, with no boiling. Lanes 1-8 were labeled with a monoclonal raised against the His-tag 
5 protein; labes 9-1 1 were labeled with rabbit anti-S ARS antibody. 

FIGURE 46: Effect of SARS spike protein expression on cell viability. 

FIGURE 47: Weston blot analyses of COS7 cell lysates on 4% TG SDS gels showing 
oligomerization state of intracellular spike molecules. Blots were labeled with anti-His-tag mAb. 
Hie membrane fraction of COS7 cell lysate was fractionated by a sizing column before loading 
10 the lanes. Fractions 7 to 14 show bands with kDa values of: 71000, 1400, 898, 572, 365,232, 148 
and 99, respectively. 

FIGURE 48: Fractionation of cells into aqueous and detergent fractions. 

FIGURE 49: Schematic of constructs for use in OMV preparation. 

FIGURE 50: SARS HR1 and HR2 constructs. 

15 FIGURE 5 1 : Vaccine protection froms SARS in Balb/c mouse model. 

FIGURE 52: Expressed on Spike protein in transfected 293 cell lysates (52A) or COS7 cell 
culture supernatants (52B). Proteins were separated on 4-20% TG SDS gels. The label was anti- 
His-tag, except for the right-hand three lanes of 52B, where the label was rabbit anti-S ARS 
serum, hi Figure 52A, the left-hand three lanes were treated with DTT and were boiled, but 
20 neither treatment was used for the right-hand three lanes. In Figure 52B, no DTT was used, but 
all lanes were heated to 80°C for 5 minutes. 

FIGURE 53: Western blot of Spike proteins expressed in COS7 cells. Proteins were incubated at 
room temperature (RT), 80°C or 100°C to check for any effect on molecular weight FIGURE 54 
shows similar experiments on SARS virions. 

25 FIGURE 55: Results of a pulse chase experiment, showing expression and processing of SARS 
spike protein following infection with alphavirus replicon particles. Cells were treated with or 
without EndoH as shown. 

FIGURE 56: Effect of heating on Spike protein trimers. 

FIGURE 57: Coomassie blue-stained gel of yeast-expressed proteins. Lanes are: 1-See Blue 
30 Standard (10/il); 2-pAB24 gbl (20/ig); 3-SARS Spike SI c.l gbl (20jig); 4-SARS Spike SI c.2 
gbl (20/ig); 5-See Blue Standard (10/xl); 6-pAB24 ip (5/d); 7-SARS Spike SI c.l (5/xl); 8-SARS 
Spike SI c.2 (5/il). 

FIGURES 58 to 64: Schematics of preparation of yeast expression constructs. 
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FIGURES 65 to 66: Yeast-expressed sequences for Spike. 

FIGURE 67: Western blots showing expression of SARS spike protein from alphavirus replicon 
particles and replicon RNA. Figure 67 A was run under non-reducing conditions and at room 
temperature (le. no heating), with lanes: (1) VEE/SIN-spike infection; (2) VEE/SIN-GFP 
5 infection; (3) Replicon-spike RNA transfection; (4) Replicon-GFP RNA transfection. Figure 67B 
was run with SARS virions at different temperatures, as shown. 

FIGURE 68: induction of antibody responses in mice. Vaccine groups are: (1) Inactivated SARS 
Virus; (2) Truncated Recombinant Spike Protein; (3) Full length Spike: DNA+DNAJPLG+ 
Alphavirus; (4) Full length Spike: Alphavirus particles only. 

10 FIGURE 69: Binding of human monoclonal antibody S3.2 to purified truncated Spike protein. 
Hie X-axis shows antibody concentration, and the Y-axis shows ELISA absorbance. The 
interpolation result is 2158.13. 

FIGURE 70: Geometric mean ELLS A titers of antibodies induced by the SARS-CoV spike 
protein delivered as different vaccines (left to rigjht: inactivated virus; 3fig truncated spike 
15 protein; 75fig DNA encoding truncated spike protein. 

FIGURE 71 : Neutralization titers after immunization with (left) nSdATC protein or (right) DNA 
encoding nSdATC, delivered on PLG. 

FIGURE 72: Correlation between the spike antigen binding and neutralizing antibodies 

FIGURE 73: Western blot of CHO cell lines expressing Spike protein in full-length form (left) or 
20 in truncated form (right). Proteins were separated by 4-12% SDS-PAGE, with boiling in DTT 
and staining by polyclonal serum. 

FIGURE 74: Structural components of SARS-CoV spike glycoprotein and expression construct. 
L denotes leader peptide (residues 1-13), TM the transmembrane, and Cy the cytoplasmic tail 
segments. The hexa-His tags are not shown. 

25 FIGURE 75: Western blot analysis of SARS spike proteins expressed in COS7 cells. In Figure 
75 A, COS7 cells were transfected with indicated plasmid constructs and the expressed proteins 
in cell lysates 48 hr post-transfection were analysed by SDS-PAGE (4-20% polyacrylamide) in 
reducing and denaturing condtions, with proteins visualized by anti-histidine Mab. In Figure 
75B, proteins were collected from cell culture medium 48 hr post-transfection and purified first 

30 by a ConA column and then by KBs-tag magnetic beads. Purifed proteins were analysed by SDS- 
PAGE (4-20% polyacrylamide) and were visualized by anti-SARS rabbit serum. 
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FIGURE 76: Endo H sensitivity of C-terminal truncated spike protein (SA) found in cell lysate 
Ganes 1,2) and culture medium (lanes 3,4). Positions of internal SA protein and secreted SA 
protein are marked with arrow heads. 

FIGURE 77: Oligomeric status of the SARS spike protein. Recombinant S protein oligomer in 
5 COS7 cells transfected with the full-length spike construct (nSh). The cell lysates were treated 
with DTT and/or heat as indicated above each lane. Hie different forms of S protein in treated 
and untreated samples were visualized by SDS-PAGE (4% polyacrylamide) and Western blot 
analysis using anti-histidine MAb. 

FIGURE 78: Effect of heat denaturation on the oligomeric status of recombinant S protein in the 
10 absence of DTT. The COS7 cell lysates were heated before the electrophoresis as indicated and 
the S proteins were visualized as described fogiFigure 77. 

FIGURE 79: Effect of heat denaturation on die oligomeric status of spike protein in SARS virion 
particles. SARS-CoV were grown in Vero cells, purified and solubilized from the virion particles 
by SDS, heat-denatured as indicated and visualized as described in Hgure 77, except that rabbit 
15 antiserum against the purified virus was used as a probe. 

FIGURE 80: Analysis of the oligomeric status of SARS virion spike protein by cross-linking 
experiment. Solubilized SARS virion proteins were treated with DMS. Both untreated (-) and 
DMS treated (+) virion proteins were heat denatured in the absence of DTT and visualized by 
4% PAGE followed by silver staining. 

JO FIGURES 81 & 82: Analysis of the oligomeric status of truncated spike protein by heat 
denaturation. Truncated spike protein within COS7 cell lysates (81) or secreted into culture 
medium (82) were heat denatured as indicated in the absence of DTT and visualized by Western 
blot analysis. 

FIGURE 83: Reactivity of deglycosylated full-length spike oligomer with conformational and 
15 non-conformational antibody. The full-length recombinant spike oligomer was partially 

deglycosylated with PNGase F in non denaturating condition and visualized by Western blot 
analysis using anti-histidine Mab (lane 1,2,3) or rabbit antiserum against purified SARS CoV 
(lane 4,5,6). 

FIGURE 84: Localization of expressed SARS spike proteins in fractionated COS7 cell lysate 
0 visualized by western blot Cells were transfected with indicated plasmids and lysed with 

Dounce homogeniser in hypotonic buffer 48 hr post transfection. Cell lysate was centrifuged to 
obtain soluble cytosol and insoluble membrane fraction that was further solublized by 4% Triton 
X-100. Proteins were heated with SDS at 80 C and analysed by SDS-PAGE (4-20% 
polyacrylamide) in reducing condtion. Proteins were visualized by anti-histidine Mab. The 
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cytosol fractions were loaded in lanes 1, 3, and 5 and the membrane fractions were loaded in 
lanes 2, 4, and 6. 

FIGURE 85: Intracellular and surface expression of recombinant full-length (A J)) or truncated 
(BJE) spike protein in COS7 cells. The cells were fixed at 48 hrs posttransfection and either 
5 treated with detergent (Cytofix/perm, BD Biosciences) for intracellular immunofluorescence 
(A3,Q or with 2% paraformaldehyde for cell surface immunofluorescence observation (D3J0 
at x40 magnification. Mock transfected cells (CJF) were included as controls. 

FIGURES 86-105: SDS-PAGE od Exoli expressed proteins. Tot = total protein; Sol = soluble 
protein fraction. Labels are protein names (Tables 26-30). 

10 FIGURE 106: Immunofluorescence after administration of vector encoding optimsed N antigen. 

FIGURE 107: Immunofluorescence of (A) native and (B) codon-optimsed M sequences. 

FIGURE 108: Immunofluorescence of (A) native and (B) codon optimsed E sequences. 

FIGURES 109-1 1 1 : Western blots of Vero cells using rabbit antibodies obtained after . 
immunization with spike proteins expressed in Exoli. 

15 FIGURE 1 12: Spike protein expression in 293 cells. Lanes: (M) Markers; (1) Mock transfected; 
(2,6) cells expressing nS protein, lysate; (3,7) cells expressing nSdTC protein, lysate; (4,8) cells 
expressing nS protein, supernatant; (5,9) (4) cells expressing nSdTC protein, supernatant 
Staining antibody: (2 to 5) mouse serum obtained after DNA immunization; (6 to 9) rabbit serum . 
obtained after immunization with whole killed virus. 

20 FIGURE 1 13: Six reading frames of SEQ ID NO: 9968. 

FIGURE 1 14: Six reading frames of SEQ ID NO: 10033. 

FIGURE 115: Alignment of bovine coronavirus pol lab (top row; SEQ ID NO: 10068), avian 
infectious bronchitis pol lab (second row; SEQ ID NO: 10069), murine hepatitis virus pol lab 
(third row; SEQ ID NO: 10070), SEQ ID NO s : 9997/9998 (fourth row) and a consensus 
25 sequence (bottom row; SEQ ID NO: 10071). 

FIGURE 116: Schematic of coronavirus genome organization. 

FIGURE 117: Schematic of coronavirus ORFla/ORFlb gene products, including region. 
FIGURE 118: Alignment. 

FIGURE 1 19: Alternative start codons within SEQ ID NO: 10080. 
30 FIGURE 120: Six reading frames of SEQ ID NO: 10084. 

FIGURE 121: Alignment of SEQ ID NO: 10033 and SEQ ID NO: 10084. 
FIGURE 122: Reading frames in SEQ ID NO: 10084. 
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FIGURE 123: Start codon analysis for SEQ ID NO: 10084. 
FIGURE 124: BLAST analysis of SEQ ID NO: 10210. 

FIGURE 125: Epitope analysis of SEQ ID NO: 10210 by either (13A) Hopp & Woods or (13B) 
Kyte&Doolittle. 

5 FIGURE 126: Reading frames in SEQ ID NO: 10299. 

FIGURE 127: Reading frames in SEQ ID NO: 10505. 

FIGURE 128: Reading frames in SEQ ID NO: 11563. 

FIGURE 129: Reading frames in SEQ ID NO: 10033. 

FIGURE 130: Alignment of SEQ ID NO: 9997 and SEQ ID NO: 10033. 
10 FIGURE 131: Reading frames in SEQ ID NO: 10299. 

FIGURE 132: Reading frames in SEQ ID NO: 10505. 

FIGURE 133: Western Blot of SARS protease purification fractions. 

FIGURE 134: Cleavage of DABCYL-EDANS (a fluorescent tagged peptide with a SARS 

protease cleavage site) by SARS protease at different concentrations. The graph shows 
15 activity/concentration correlations with no protease (♦), 0.95 uM protease (■) and 2.86 uM 

protease (•). 

In the event of a discrepancy between a sequence in the sequence listing and a sequence 
in the drawings, the drawings should take precedence. 

DETAILED DESCRIPTION OF THE INVENTION 

20 The practice of the present invention will employ, unless otherwise indicated, conventional 

methods of chemistry, biochemistry, molecular biology, immunology and pharmacology, within 
the skill of the art. Such techniques are explained fully in the literature. See, e.g., Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., 19th Edition (1995); Methods 
In Enzymology (S. Colowick and N. Kaplan, eds., Academic Press, Lie); and Handbook of 

25 Experimental Immunology, Vols. I-IV (D.M. Weir and C.C. Blackwell, eds., 1986, Blackwell 
Scientific Publications); Sambrook, et dL, Molecular Cloning: A Laboratory Manual (2nd 
Edition, 1989); Handbook of Surface and Colloidal Chemistry (Birdi, K.S. ed, CRC Press, 
1997); Short Protocols in Molecular Biology, 4th ed. (Ausubel et aL eds., 1999, John Wiley & 
Sons); Molecular Biology Techniques: An Intensive Laboratory Course, (Ream et aL, eds., 1998, 

30 Academic Press); PCR (Introduction to Biotechniques Series), 2nd ed. (Newton & Graham eds., 
1997, Springer Verlag); Peters and Dalrymple, Fields Virology (2d ed), Fields et aL (eds.), BJM. 
Raven Press, New York, NY. 
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AD publications, patents and patent applications cited herein, are hereby incorporated by 
reference in their entireties. 

Severe Acute Respiratory Syndrome (SARS) virus has recently been identified as a new 
viral species. The SARS viral species includes the following isolates. 
5 - two virus isolates described in Peiris et aL "Coronavirus as a possible cause of severe acute 
respiratory syndrome" Lancet published online at 

http:/Amage. thekmcet com/extras/03art3477web.pdf on April 8 2003, incorporated herein by 
reference in its entirety and the sequences deposited with GenBank at accession number 
AY268070. 

10 - the isolates and viral sequences described in Drosten et al. , 'Identification of a Novel 

Coronavirus in Patients with Severe Acute Respiratory Syndrome", New England Journal 
of Medicine, published online at http://www.nejm.org on April 10, 2003. 

- the isolates and viral sequences described on the website of the WHO network on March 25 
and 24, 2003. 

15 - the isolates and viral sequences described in Tsang et al. , "A Cluster of Cases of Severe 

Acute Respiratory Syndrome in Hong Kong" New England Journal of Medicine, published 
online at http://www.nejm.org on March 31, 2003. 

- the isolates and viral sequences described in Poutanen et al. , 'Identification of Severe 
Acute Respiratory Syndrome in Canada" New England Journal of Medicine, published 

20 online at http://www.nejm.org on March 31, 2003. 

As described in the Lancet article, a 646 base pair polynucleotide from the SARS virus has weak 
homology to viruses of the family Cornoaviridae. The Lancet article further reports that a 
deduced amino acid sequence (of 215 amino acids) from this sequence has about 57% sequence 
homology to the RNA polymerase of bovine coronavirus and murine hepatitis virus. 

25 Phylogenetic analysis of the protein sequences are also presented in the Lancet article showing 
that the polymerase sequence is most closely related to the group II coronaviruses. 

Additional SARS viral isolates can be identified, isolated and/or sequenced by virologists 
skilled in the art Virologists can readily identify new viral isolates as a SARS virus. Criteria 
which a virologist may use to identify new SARS isolates include: sequence homology of the 

30 new isolate to known SARS viral isolates; similar genomic organization of the new viral isolate 
to known SARS viral isolates; immunological (serologic) similarity or identity with known 
SARS viral isolates; pathology; and similarity of virion morphology with known SARS viral 
isolates; and similarity of infected cell morphology as that caused by known SARS viral isolates 
(visualized, for instance, by electron microscopy). 

35 Methods for isolating and sequencing SARS viral isolates include the methods described 

by Peiris et al. in the Lancet paper. As reported in the Lancet paper, RNA from clinical samples 
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can be reverse transcribed with random hexamers and cDNA can be amplified with primers 
having sequences of SEQ ID NOS: 6584 & 6585 in the presence of 2.5 mmol/L magnesium 
chloride (94°C for 1 min, 50°C for 1 min, and 72°C for 1 min). 

Reverse transcription of a viral isolate using random hexamers can be accomplished in an 
5 RT-PCR assay as follows. Virus isolates are propagated on mammalian cells, particularly fetal 
rhesus kidney cells. Total RNA from virus-infected and viras-uninfected fetal rhesus kidney 
cells is then isolated RNA samples are reverse transcribed with a primer having SEQ ID NO: 
6586. cDNA can be amplified by a primer having SEQ ID NO: 6587. Unique PCR products (in 
size) in the infected cell preparation are then cloned and sequenced, and genetic homology of the 

10 sequence compared with those in GenBank. 

One skilled in the art would be able to identify and clone additional genomic regions using 
a variety of standard cloning techniques, such as, for example, using random primer RT-PCR 
and detection of sequences overlapping one or more of the above sequences, and/or using 
oligonucleotide primers, e.g., degenerate primers, based on the sequences provided herein (see 

15 Figures 1-5, Figures 8-11, SEQ ID NOS: 3-20). 

Cloning, sequencing and identification of SARS virus by one skilled in the art can be 
further facilitated by the use of polynucleotide sequences, particularly RNA polymerase 
sequences, from related Coronaviruses. 

Sequence homology of new viral isolates with the known SARS isolates described above 

20 can be readily determined by one skilled in the art New SARS isolates may be identified by a 
percent homology of viral nucleotide sequences of 99%, 95%, 92%, 90%, 85%, or 80% 
homology of the new virus to known SARS viral polynucleotide sequences. New SARS isolates 
may also be identified by percent homology of 99%, 95%, 92%, 90%, 85%, or 80% homology of 
: the polypeptides encoded by the polynucleotides of the new virus and the polypeptides encoded 

15 by known SARS virus. 

New SARS isolates may also be identified by a percent homology of 99%, 95%, 92%, 
90%, 85%, or 80% homology of the polynucleotide sequence for specific genomic regions for 
the new virus with the polynucleotide sequence for specific genomic regions of the known SARS 
viruses. Additionally, new SARS isolates may be identified by a percent homology of 99%, 

JO 95%, 92%, 90%, 85%, or 80% homology of the polypeptide sequence encoded by the 

polynucleotide of specific genomic regions of the new SARS virus to the polypeptide sequence 
encoded by the polynucleotides of specific regions of the known SARS virus. These genomic 
regions may include regions (e.g. 9 gene products) which are typically in common among 
numerous coronaviruses, as well as group specific regions (e.g., antigenic groups), such as, for 

»5 example, any one of the following genomic regions which could be readily identified by a 
virologist skilled in the art: 5'untranslated region (UTR), leader sequence, ORFla, ORFlb, 
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nonstructural protein 2 (NS2), hemagglutinin-esterase glycoprotein (HE) (also referred to as B3), 
spike glycoprotein (S) (also referred to as B2), ORF3a, ORF3b, ORF3x, nonstructural protein 4 
(NS4), envelope (small membrane) protein (E) (also referred to as sM), membrane glycoprotein 
(M) (also referred to as El), ORFSa, ORF5b, nucleocapsid phosphoprotein (N), ORF7a, ORF7b, 
5 intergenic sequences, 3'UTR, or RNA dependent RNA polymerase (pol). The SARS virus may 
have identifiable genomic regions with one or more the above-identified genomic regions. A 
SARS viral antigen includes a protein encoded by any one of these genomic regions. A SARS 
viral antigen may be a protein or a fragment thereof, which is highly conserved with 
coronaviruses. A SARS viral antigen may be a protein or fragment thereof, which is specific to 
10 the SARS virus (as compared to known comaviruses). (See, Figures 1-5, Figures 8-1 1, SEQ ID 
NOS: 3-20). 

One skilled in the art could also recognize electron microscopy of a SARS virus infected 
mammalian cell. Electron microscopy of SARS infected cells are shown in the Lancet paper. 
As discussed in the paper, electron.microscopy of negative stained (3% potassium phospluy- 

15 tungstate, pH 7.0) ultracentrifuged cell-culture extracts of SARS infected fetal rhesus kidney 
cells show the presence of pleomorphic enveloped vims particles of around 80-90 nm (range 70- 
130 nm) in diameter with surface morphology compatible with a coronavirus (see Lancet paper, 
Figure 1). Thin-section electron microscopy of infected cells reveals virus particles of 55-90 nm 
diameter within smooth walled vesicles in the cytoplasm (see Lancet paper, Figure 2B). 

10 Electron microscopy can also be used to observe virus particles at the cell surface. Electron 
microscopy of a human lung biopsy sample depicts similar viral morphology. See Lancet paper 
Figure 2A. 

/. SARS POLYPEPTIDES AND POLYNUCLEOTIDES 

The invention relates to nucleic acids and proteins from SARS virus. Such polynucleotides 
15 and polypeptides are exemplified further below. 

In one embodiment, the polynucleotides of the invention do not include one of the 
following five primers, disclosed at http://contera.n^n.org/cgi/reprint/N^ SEQ ID 

NOS: 6034-38. 

The invention also includes polynucleotide sequences which can be used as probes for 
JO diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
SEQ ID NOS: 21-1020. The invention further includes polynucleotide sequence comprising the 
complement of one or more of the primer sequences identified in SEQ ID NOS: 21-1020. 
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Hie invention includes a polypeptide sequence comprising an amino acid sequence from 
the sequence shown in Figure 23. Such amino acid sequences are SEQ ID NOS: 6588-6809. Hie 
invention includes polypeptides comprising an amino acid sequence having sequence identity to 
these sequences, and the invention includes a fragment of a polypeptide comprising one of these 
5 sequences. 

Hie invention includes a polypeptide comprising an amino acid sequence from the 
sequence shown in Figure 24. Such amino acid sequences are SEQ ID NOS: 6810-7179. Hie 
invention includes a protein comprising an amino acid sequence having sequence identity to 
these sequences, and the invention includes a fragment of a protein comprising one of these 
10 sequences. 

The invention includes a protein comprising SEQ ID NO: 6039. Hie invention includes a 
polypeptide comprising an amino acid sequence having sequence identity to SEQ ID NO: 6039. 
Hie invention includes a fragment of a polypeptide comprising SEQ ID NO: 6039. Hie 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6039, or a 

15 fragment thereof. Hie invention includes a diagnostic kit comprising a polynucleotide sequence- 
encoding SEQ ID NO: 6039, or a fragment thereof. The invention includes an immunogenic 
composition comprising a polypeptide comprising SEQ ID NO: 6039, or a fragment thereof. 
Hie invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 
6039, or a fragment thereof. SEQ ID NO: 6039 demonstrates functional homology with ORFla 

20 of coronaviruses. 

Predicted transmembrane or hydrophobic regions of SEQ ID NO: 6039 are identified 
below. Although the polyprotein of coronaviruses is proteolytically cleaved into numerous 
smaller proteins, hydrophobic domains in the polyprotein are known to mediate the membrane 
association of the replication complex and to be able to dramatically alter the architecture of host 

25 cell membranes. Accordingly, the hydrophobic domains of the polyprotein are targets for 

genetic mutation to develop attenuated SARS virus vaccines. The hydrophobic domains are also 
targets for small molecule inhibitors of the SARS virus. The hydrophobic domains may also be 
used to generate antibodies specific to those regions to treat or prevent SARS virus infection. 
Predicted Transmembrane Helices in SEQ ID NO: 6039 

30 

The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 

Inside to outside helices : 43 found Outside to inside helices : 43 found 
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Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6039, wherein said fragment comprises an amino acid sequence including one or more of the 
hydrophobic transmembrane sequences identified above. Hie invention includes a polypeptide 
5 comprising a fragment of SEQ ID NO: 6039 wherein said fragment comprises one or more of the 
following polypeptide sequences of SEQ ID NO: 6039: 473-488, 529-549, 584-606, 773-791, 
2098-2119, 2145-2160, 2206-2224, 2316-2332, 2335-2358, 2373-2390, 2753-2770, 2831-2854, 
2879-2900, 2990-3012, 3024-3042, 3054-3075, 3105-3127, 3438-3455, 3559-3584, 3589-3606, 
3611-3629, 3659-3674, 3756-3777, 473-488, 583-606, 776-791, 2098-2119, 2208-2231, 2309- 

10 2332, 2342-2368, 2753-2770, 2832-2854, 2990-3005, 3020-3042, 3059-3075, 3105-3127, 3142- 
3162, 3437-3453, 3560-3577, 3591-3606, 3610-3627, 3656-3678, 3710-3738, 3723-3738, and 
3760-3777. Preferably, the fragment comprises one or more of the following polypeptide 
sequences of SEQ ID NO: 6039: 2206-2224, 2316-2332, 2335-2358, 2753-2770, 3024-3042, 
3054-3075, 3105-3127, 3589-3606, 3611-3629, 3756-3777, 2208-2231, 2753-2770, 3020-3042, 

15 3059-3075, and 3591-3606. Preferably, the fragment comprises one or more of the following 
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polypeptide sequences of SEQ ID NO: 6039: 2206-2224 and 3020-3042. ITie invention also 
includes polynucleotides encoding each of the polypeptide fragments identified above. 

The invention includes an attenuated SARS virus wherein said attenuated SARS virus 
contains an addition, deletion or substitution in the polynucleotides encoding for one of the 
5 hydrophobic domains identified above. The invention also includes a method for creating an 
attenuated SARS virus comprising mutating a SARS virus by adding, deleting or substituting the 
viral genome of the SARS virus to alter the coding of one or more of the hydrophobic domains 
of SEQ ID NO: 6039 identified above. 

The invention includes an antibody which specifically identifies one or more of the 
10 hydrophobic regions of SEQ ID NO: 6039 identified above. The invention includes a small 

molecule which binds to, interferes with the hydrophobicity of or otherwise disrupts one or more 
of the hydrophobic regions of SEQ ID NO: 6039 identified above. 

Predicted N-glycosylation sites of SEQ ID NO: 6039 are identified in the chart below. 
Prediction of N-glycosylation sites in SEQ ID NO: 6039 



15 



Position Potential Jury NGlyc 

agreement result 



48 NGTC SEQ ID NO: 7180 0.6371 (7/9) + 

.389 NHSN SEQ ID NO: 7181 0.6132 (6/9) + 

20 916 NFSS SEQ ID NO: 7182 0.5807 (7/9) + 

1628 NHTK SEQ ID NO: 7183 0.5610 (7/9) + 

1696 NKTV SEQ ID NO: 7184 0.5297 (5/9) + 

2031 NPTI SEQ ID NO: 9764 0.5299 (5/9) + WARNING: PRO- 

XI. 

IS 2249 NSSN SEQ ID NO: 7185 0.6329 (9/9) ++ 

2459 NPTD SEQ ID NO: 9765 0.5599 (6/9) + WARNING: PRO- 

XI. 

2685 NVSL SEQ ID NO: 7186 0.6071 (8/9) + 

4233 NATE SEQ ID NO: 7187 0.6144 (7/9) + 

30 

Accordingly, the invention comprises a fragment of SEQ ID NO: 6039 wherein said 
fragment comprises an amino acid sequence which includes one or more of the N-glycosylation 
sites identified above. Preferably, the fragment comprises one or more sequences selected from 
the group consisting of SEQ ID NOS: 7180-7187 & 9764-9765. Preferably, the fragment 
J5 comprises the amino acid sequence NSSN (SEQ ID NO: 7185). 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6039 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6039 are identified in Table 13. The invention includes a 
\0 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified as SEQ ID 
NOS: 7400-7639; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
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polypeptides of (a) or (b). Hie invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified as SEQ 
ID NOS: 7400^7639, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. 

The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The ORFla and ORFlb sequences of coronaviruses are typically translated as a single 
ORFlab polyprotein. Slippage of the ribosome during translation generates an a-1 frameshift 
One region of such slippage is illustrated below: 

gggttttacacttagaaacacagtctgtaccgtctgcggaatgtggaaaggttatggctgtagttgtga 
+1 GFTLRNTVCTVCGMWKGYGC S'CD 
+3GPYT-KHSLYRLRNVERLWL-I/- 

ccaactccgcgaacccttgatgcagtctgcggatgcatcaacgtttttaaacgggtttgcggtgtaagt 
+1 QLREPLMQSADASTFLNGFAV-V 
+3 PTPRTLDAVCGC INVFKRVCGVS 

gcagcccgtcttacaccgtgcggcacaggcactagtactg (SEQ ID NO: 7224) 
+1 QPVLHRAAQA LVL (SEQ ID NOS: 7225-7226) 
+3AARLTPCGTGTST (SEQ ID NOS: 7227-7229) 

which would generate the following translational slippage (SEQ ID NOS: 7230-7231): 

ccaactccgcgaacccttgatgcagtctgcggatgcatcaacgtttttaaacgggtttgcggtgtaagt 
QLREPLMQ SADASTFLNRVCGVS 

Accordingly, the invention includes a polypeptide comprising SEQ ID NO: 7232. The 
invention includes a polypeptide comprising an amino acid sequence having sequence identity to 
SEQ ID NO: 7232. The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
7232 The invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 
7232 or a fragment thereof. Hie invention includes a diagnostic kit comprising a polynucleotide 
sequence encoding SEQ ID NO: 7232 or a fragment thereof. The invention includes an 
immunogenic composition comprising a polypeptide comprising SEQ ID NO: 7232 or a 
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fragment thereof. Hie invention includes an antibody which recognizes a polypeptide 
comprising SEQ ID NO: 7232 or a fragment thereof. 

The invention also includes a polypeptide comprising amino acid sequence X1-X2-X3, 
where Xi is SEQ ID NO: 7233, X 2 is from one to ten amino acids, and X 3 is SEQ ID NO: 7234. 

5 X2 can comprise any sequence of one to ten amino acids (SEQ ID NOS: 7235-7244) but, in 
preferred embodiments, X2 is selected from the group consisting of F, EL, ELN, FLNR (SEQ ID 
NO: 7245), ELNRV (SEQ ID NO: 7246) and FLNRVC (SEQ ID NO: 7247). Preferably, X 2 is 
SEQ ID NO: 7247. Hiese preferred embodiments are shown as SEQ ID NOS: 7248-7253. 

The invention includes a polypeptide comprising an amino acid sequence having sequence 

10 identity to said amino acid sequences X1-X2-X3. The invention includes a fragment of a 
polypeptide comprising said amino acid sequences X1-X2-X3. The invention includes a 
diagnostic kit comprising a polypeptide comprising said amino acid sequences X1-X2-X3 or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding said amino acid sequences X1-X2-X3 or a fragment thereof. The invention includes an 

15 immunogenic composition comprising a polypeptide comprising said amino acid sequences 
X1-X2-X3 or a fragment thereof. Hie invention includes an antibody which recognizes a 
polypeptide comprising said amino acid sequences X1-X2-X3 or a fragment thereof. 

The amino acid sequences Xi-X 2 -X 3 (le. SEQ ID NOS: 7235-7244) demonstrate 
functional homology with the polyprotein of murine hepatitis virus. This polyprotein is cleaved 

20 to produce multiple proteins. Proteins which can be generated from the X1-X2-X3 polyprotein, 
where X 2 is six amino acids (SEQ ID NO: 7240) are listed below. 



Mouse virus protein 
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Hie invention includes a fragment of the amino acid sequence X1-X2-X3 (i.e. SEQ ID 

NOS: 7235-7244) wherein the fragment comprises one of the polypeptide sequences identified in 

25 the above table. The invention further includes a fragment of the amino acid sequence X1-X2-X3 

wherein said fragment comprises a polypeptide sequence which has a serine at its N-terminus 

and a glutamine at its C-terminus. The invention further includes a fragment of the amino acid 

sequence X1-X2-X3 wherein said fragment comprises a polypeptide sequence which has an 
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Alanine at its N-terminus and a glutamine at its C-terminus. The invention further includes a 
fragment of the amino acid sequence X1-X2-X3 wherein said fragment comprises a polypeptide 
sequence which has a Asparaglne at its N-terminus and a glutamine at its C-terminus. Hie 
invention further includes a fragment of the amino acid sequence X1-X2-X3 wherein said 
5 fragment comprises a Cysteine at its N-terminus and a Glutamine at its C-terminus. Each of the 
fragments identified above can be used in fusion proteins. 

The invention includes a diagnostic kit comprising a polypeptide comprising at least one of 
the fragments of the amino acid sequence Xi-X r X 3 (Le. SEQ ID NOS: 7235-7244) identified in 
the above paragraph. The invention includes a diagnostic kit comprising a polynucleotide 

10 sequence encoding at least one of the fragments of the amino acid sequence X1-X2-X3 identified 
in the above paragraph. The invention includes an immunogenic composition comprising a 
polypeptide comprising at least one of the fragments of the amino acid sequence X1-X2-X3 
identified in the above paragraph. The invention includes an antibody which recognizes a 
" polypeptide comprising at least one of the fragments of the amino acid sequence X1-X2-X3 

15 identified in the above paragraph. • 
Predicted N-glycosylation sites of the amino acid sequence X1-X2-X3 when X2 is six amino 
acids are identified at the asparagines located at the following amino acid positions 48; 389; 556; 
916; 1628; 1696; 1899; 2079; 2249; 2252; 2507; 2685; 3303; 3373; 3382; 3720; 4150; 4233; 
4240; 5016; 5280; 5403; 5558; 5650; 5905; 6031; 6130; 6474; 6918; 6973. Accordingly, the 

20 invention comprises a fragment of SEQ ID NO: 7239 wherein said fragment is at least ten amino 
acids and wherein said fragment comprises one or more of the asparagines from the amino acid 
positions of SEQ ID NO: 7239 selected from the group consisting of 8; 389; 556; 916; 1628; 
1696; 1899; 2079; 2249; 2252; 2507; 2685; 3303; 3373; 3382; 3720; 4150; 4233; 4240; 5016; 
5280; 5403; 5558; 5650; 5905; 6031; 6130; 6474; 6918; and 6973. 

25 A zinc binding region 2 site within SEQ ID NOS: 7235-7244 is identified at amino acid 

residues 2102-2112 (SEQ ID NO: 7254 HGIAAINSVPW). The polypeptide of SEQ ID NOS: 
7235-7244 will be processed by the SARS virus into multiple peptides. This zinc binding region 
falls within the nspl region of the polypeptide. SEQ ID NO: 7254 is a target for screening of 
chemical inhibitors to the SARS virus. The invention includes a polypeptide comprises SEQ ID 

30 NO: 7254. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ 
ID NO: 7254. The invention includes a method of screening SEQ ID NO: 7254 for an inhibitor. 
The invention includes the recombinant expression of SEQ ID NO: 7254 in a host cell. The 
invention includes a fragment of SEQ ID NOS: 7235-7244, wherein said fragment comprises 
SEQ ID NO: 7254. The invention includes a polypeptide comprising SEQ ID NO: 7254 wherein 

35 said polypeptide is complexed with a zinc ion. The invention includes a small molecule which 
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• prevents a zinc ion from complexing with the polypeptide of SEQ ID NO: 7254. Hie invention 
includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 7254, 

Hie polyprotein encoded by the SARS virus will contain at least two protease domains: a 
papain-like cystein protease (PLP) and a chymotrypsin-picomavirus 3C-like protease (3CLp). 

5 (There may be more than one copy of the PLP domain). These proteases function to cleave the 
polyprotein into multiple smaller proteins. Hie 3C-like protease, also known as the "main 
protease" or Mpro, is itself cleaved from the polyprotein by its own autoprotease activity. See 
generally, Chapter 35 of Fields Virology (2nd ed), Fields et aL (eds.), B.N. Raven Press, New 
York, NY, and Anand et aL, EMBO Journal (2002) 21 (13): 3213-3224. This 3CLp generally 

10 corresponds with the Nsp2 region identified above. 

The SARS virus 3CLp protein is further characterized by SEQ ID NO: 6569 (also SEQ ID 
NO: 9769), as shown in FIGURE 15. 

FIGURE 16 also illustrates the SARS virus 3CLp, in allignment with the 3CLp of avian 
infectious bronchitis (IBV; SEQ ID NO: 6570), mouse hepatitis virus (MHV; SEQ ID NO: 

15 6571), and bovine coronavirus (BCoV; SEQ ID NO: 6572). Accordingly, the invention includes 
a polypeptide sequence comprising SEQ ID NO: 6569, or a fragment thereof, or a polypeptide 
sequence having sequence identity thereto. The invention further includes a polynucleotide 
sequence encoding SEQ ID NO: 6569, or a fragment thereof. The invention includes a 
polynucleotide sequence encoding a polypeptide sequence having sequence identity to SEQ ID 

20 NO: 6569. 

The invention further includes a method of screening for an inhibitor of the SARS virus 
3CLp protein. In one embodiment, the invention includes a method of screening for an inhibitor 
of SEQ ID NO: 6569. The invention includes a method of recombinantly expressing the SARS 
virus 3CLp protein in a host cell. Hie invention includes a method of recombinantly expressing 

25 a polypeptide sequence comprising SEQ ID NO: 6569 or an enzymatically active fragment 
thereof or a polypeptide sequence having sequence identity thereto. The invention includes a 
small molecule which inhibits or reduces the proteolytic activity of the SARS virus 3CLp 
protein. The invention includes a small molecule which inhibits or reduced the proteolytic 
activity of the polypeptide comprising SEQ ID NO: 6569. 

30 Catalytic residues of the SARS virus 3CLp are identified in FIGURE 15 and 16. 

Specifically, a catalytic histidine and a catalytic cysteine are identified. Such catalytic sites are 
targets for small molecules which could inhibit or reduce the protease activity of 3CLp. 
Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 6569, 
wherein said fragment comprises at least one catalytic site. Preferably, the catalytic site is 

35 selected from the group consisting of the indicated catalytic histidine and the catalytic cysteine in 
FIGURE 15 and 16. The invention includes a polynucleotide encoding a polypeptide, wherein 
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said polypeptide comprises a fragment of SEQ ID NO: 6569, wherein said fragment comprises at 
least one catalytic site. Preferably, the catalytic site is selected from the group consisting of the 
indicated catalytic histidine and the catalytic cysteine. 

Hie invention further includes a method of screening a compound library to identify a 
5 small molecule which inhbits a catalytic site of a SARS virus 3CLp. Preferably, the 3CLp 
comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence having sequence identity 
thereto. The catalytic site is preferably selected from the group consisting of the indicated 
catalytic histidine and the catalytic cysteine in FIGURE 15 and 16. 

The invention includes a small molecule which inhibits the catalytic site of a SARS virus 

10 3CLp. Preferably, the 3CLp comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence 
having sequence identity thereto. Hie catalytic site is preferably selected from the group 
consisting of the indicated catalytic histidine and the catalytic cysteine in FIGURE 15 and 16. 

Residues of the substrate site of the SARS virus 3CLp are identified in FIGURE 15 aijd 16. 
Specifically, a substrate site is indicated at a phenylalanine, a tyrosine and a histidine. Such 

15 substrate sites are targets for small molecules which could inhibit or reduce the protease activity 
of 3CLp. Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO: .6569, wherein said fragment comprises at least one substrate site. Preferably, the substrate 
site is selected from the group consisting of the indicated substrate phenylalanine, tyrosine and 
histidine in FIGURE 15 and 16. The invention includes a polynucleotide encoding a 

20 polypeptide, wherein said polypeptide comprises a fragment of SEQ ID NO: 6569, wherein said 
fragment comprises at least one substrate site. Preferably, the substrate site is selected from the 
group consisting of the indicated substrate phenylalanine, tyrosine and histidine in FIGURE 15 
and 16. 

The invention further includes a method of screening a compound library to identify a 
25 small molecule which blocks a substrate site of a SARS virus 3CLp. Preferably, the 3CLp 
comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence having sequence identity 
thereto. The substrate site is preferably selected from the group consisting of the indicated 
substrate phenylalanine, tyrosine and histidine in FIGURE 15 and 16. 

The invention includes a small molecule which inhibits the substrate site of a SARS virus 
30 3CLp. Preferably, the 3CLp comprises SEQ ID NO: 6569, or a fragment thereof, or a sequence 
having sequence identity thereto. The substrate site is preferably selected from the group 
consisting of the indicated substrate phenylalanine, tyrosine and histidine in FIGURE 15 and 16. 

The invention further includes a diagnostic kit comprising a polynucleotide encoding a 
SARS virus 3CLp or a fragment thereof. Preferably, the SARS virus 3CLp comprising SEQ ID 
35 NO: 6569 or a fragment thereof or a polypeptide sequence having sequence identity thereto. 
Preferably, the fragment comprising one or more sites selected from the group consisting of a 

-27- 



WO 2004/092360 



PCT/US2004/011710 



catalytic site and a substrate site. Preferably, the catalytic site is selected from the group 
consisting of one or more of the sites identified in FIGURE 15 and 16. Preferably, the substrate 
site is selected from the group consisting of one or more of the sites identified in FIGURE 15 
and 16. 

5 Hie invention further comprises a diagnostic kit comprising an antibody specific to a 

SARS virus 3CLp or a fragment thereof. Preferably, the antibody is specific to the polypeptide 
comprising SEQ ID NO: 6569 or a fragment thereof or a polypeptide sequence having sequence 
identity thereto. Preferably, the antibody is specific to one or more sites of a SARS virus 3CLp 
selected from the group consisting of a catalytic site and a substrate site. Preferably, the catalytic 

10 site is selected from the group consisting of one or more of the sites identified in FIGURE 15 
and 16. Preferably, the substrate site is selected from the group consisting of one or more of the 
sites identified in FIGURE 15 and 16. 

Hie invention includes a polypeptide comprising an amino acid sequence from the 
sequence shown in Figure 25. Hie two amino acid sequences within Figure 25, separated by a *, 

15 are SEQ ID NOS: 7188 & 7189. The invention includes a polypeptide comprising an amino acid 
sequence having sequence identity to the figure 25 translation. The invention includes a 
fragment of a polypeptide comprising the figure 25 sequence. The invention includes a 
diagnostic kit comprising a polypeptide comprising the figure 25 translation, or a fragment 
thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence encoding < 

20 the figure 25 translation, or a fragment thereof. The invention includes an immunogenic 
composition comprising a polypeptide comprising the figure 25 translation, or a fragment 
thereof. The invention includes an antibody which recognizes a polypeptide comprising the 
figure 25 sequence, or a fragment thereof. Hie figure 25 sequence demonstrates functional 
homology with ORFlb of coronaviruses. 

25 SEQ ID NO: 7188 is an open reading frame within Figure 25. The invention includes a 

polypeptide comprising SEQ ID NO: 7188. The invention includes a polypeptide comprising an 
amino acid sequence having sequence identity to SEQ ID NO: 7188. The invention includes a 
fragment of a polypeptide comprising SEQ ID NO: 7188. The invention includes a diagnostic 
kit comprising a polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. The 

30 invention includes a diagnostic kit comprising a polynucleotide sequence encoding SEQ ID NO: 
71 88, or a fragment thereof. The invention includes an immunogenic composition comprising a 
polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. The invention includes an 
antibody which recognizes a polypeptide comprising SEQ ID NO: 7188, or a fragment thereof. 
SEQ ID NO: 7190 is an open reading frame within SEQ ID NO: 7188. Hie invention 

35 includes a polypeptide comprising SEQ ID NO: 7190, a fragment thereof or a polypeptide 

having sequence identity thereto. The invention further includes a polynucleotide encoding SEQ 
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ID NO: 7190, a fragment thereof or a polypeptide sequence having sequence identity thereto. 
An example of a polynucleotide encoding SEQ ID NO: 7190 is given as SEQ ID NO: 7191. 

SEQ ID NO: 7188 also contains an open reading frame comprising SEQ ID NO: 6042. 
Hie invention includes a polypeptide comprising SEQ ID NO: 6042. Hie invention includes a 
5 polypeptide comprising an amino acid sequence having sequence identity to SEQ ID NO: 6042. 
The invention includes a fragment of a polypeptide comprising SEQ ID NO: 6042. The 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6042, or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding SEQ ID NO: 6042, or a fragment thereof. The invention includes an immunogenic 
10 composition comprising a polypeptide comprising SEQ ID NO: 6042, or a fragment thereof. 
The invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 
6042, or a fragment thereof. SEQ ID NO: 6042 demonstrates functional homology to a 
coronavirus spike protein. 

Predicted transmembrane regions of SEQ ID NO: 6042 are identified below. 

15 Predicted. Transmembrane helices of SEQ ID NO: 6042 

The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 

Inside to outside helices : 18 found Outside to inside helices : 13 found 
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20 

SEQ ID NO: 6042, the spike protein, is a surface exposed polypeptide. Recombinant 
expression of a protein can be hindered by hydrophobic transmembrane regions. Accordingly, 
the invention includes a polypeptide comprising SEQ ID NO: 6042 wherein one or more of the 
hydrophobic regions identified above is removed. The invention further includes a 

25 polynucleotide encoding such a polypeptide. The invention includes recombinantly expressing 
the protein in a host cell. Primers for amplifying the gene for spike protein and fragments 
thereof, such as fragments encoding the soluble ectodomain, include SEQ ID NOS: 9753-9763 
(Xiao et al (2003) Biochem Biophys Res Comm 312:1159-1164). 

Further characterization of SEQ ID NO: 6042 is set forth below. 

30 PSOR3P Prediction of Protein Localization Sites 
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version 6.4 (WWW) 

SEQ ID NO: 6042 - 1255 Residues 

Species classification: 4 

5 *** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0-5) 
count : 2 

Position of the most N-terminal TMS: 496 at i=2 
10 MTOP: membrane topology (Hartmann et al.) 

I (middle): 503 Charge diffirence (ON) : 1.0 
McG: Examining signal sequence (McGeoch) 
Length of UR: 13 
Peak Value of UR: 3.28 
15 Net Charge of CR: 0 

Discriminant Score : 8.66 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 5.94 
Possible cleavage site: 13 
20 >» Seems to have a cleavable N-term signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 14 
ALOM new cnt: 1 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : 0B 
25 ALOM: finding transmembrane regions (Klein et al.) 

count: 1 value: -12.26 threshold: -2.0 

INTEGRAL Likelihood =-12.26 Transmembrane 1202-1218 (1194-1228) 
PERIPHERAL Likelihood = 0.16 

modified ALOM score: 2.55 f 
30 »> Seems to be a Type la membrane protein 

The cytoplasmic tail is from 1219 to 1255 (37 Residues) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 
35 (14) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 14 
Uncleavable? Ipos set to: 24 
Discrimination of mitochondrial target seq. : 
40 positive (2.18) 

Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

45 *** Reasoning Step: 2 

• KDEL Count: 0 
Checking apolar signal for intramitochondrial sorting 
(Gavel position 24) from: 1 to: 10 Score: 8.0 
50 SKL motif (signal for peroxisomal protein): 
pos: 964(1255), count: 1 SRL 
SKL score (peroxisome): 0.1 
Amino Acid Composition Tendency for Peroxisome: 1.37 
AAC not from the N-term. , score modified 
55 Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.079 
Amino Acid Composition tendency for lysosomal proteins 

score: 0.39 Status: notclr 
GY motif in the tail of typela? (lysosomal) 
60 Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
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5 



Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 
Type la is favored for plasma memb. proteins 
Checking the NPXY motif. . 
Checking the YXRF motif. . 
Checking N-myristoylation. . 



10 



Final Results 



plasma membrane Certainty= 0.4 60 (Affirmative) < suco 

microbo^y (peroxisome) Certainty^ 0.171 (Affirmative ) < suco 

endoplasmic reticulum (membrane) Certainty^ 0.100 (Affirmative) < suco 

endoplasmic reticulum (lumen) Certainty^ 0 .100 (Affirmative) < suco 



SEQ ID NO: 6042 appears to have a N-terminus signaling region, followed by a surface 
exposed region, followed by a transmembrane region followed by a C-terminus cytoplasmic 
domain region. Accordingly, the invention includes an immunogenic, surface exposed fragment 
of SEQ ID NO: 6042. Preferably, said fragment comprises an amino acid sequence which does 

20 not include the last 50 amino acids of the C-terminus of SEQ ID NO: 6042. Preferably, said 
fragment comprises an amino acid sequence which does not include the last 70 amino acids of 
the C-terminus of SEQ ID NO: 6042. Preferably, said fragment does not include a transdomain 
region of SEQ ID NO: 6042. Preferably, said fragment does not include a C-terminus 
cytoplasmic domain of SEQ ID NO: 6042. Preferably, said fragment does not include a N- 

25 terminus signal sequence. Preferably, said fragment does not include amino acids 1-10 of the N- 
terminus of SEQ ID NO: 6042. Preferably, said fragment does not include amino acids 1-14 of 
the N-terminus of SEQ ID NO: 6042. Two oligopeptide fragments of SEQ ID NO: 6042 that are . 
able to elicit anti-spike antibodies are SEQ ID NOS: 7398 & 7399, as described (with additional 
C-tenninus cysteines) by Xiao et al. (2003) Biochem Biophys Res Comm 312:1159-1164. 

30 C-terminal truncations of spike protein, with removal of part of the cytoplasmic region, or 
removal upto and including the transmembrane region, are described by Yang et aL (2004) 
Nature 428:561-564. 

A variant of SEQ ID NO: 6042 that is included within the invention is SEQ ID NO: 9962. 
Compared to SEQ ID NO: 6042, this sequence has Ser at residue 581 instead of Ala, and has Phe 

35 at residue 1 152 instead of Leu. 

The spike protein of coronaviruses may be cleaved into two separate chains into SI and S2. 
The chains may remain associated together to form a dimer or a trimer. Accordingly, the 
invention includes a polypeptide comprising SEQ ID NO: 6042 wherein said polypeptide has 
been cleaved into SI and S2 domains. Hie invention further includes a polypeptide comprising 

40 SEQ ID NO: 6042 wherein amino acids 1-10, preferably amino acids 1-14 of the N-terminus are 
removed and further wherein SEQ ID NO: 6042 is cleaved into SI and S2 domains. Preferably 
the polypeptide is in the form of a trimer. 



15 
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The spike protein appears to form an alpha-helical structure in the transmembrane region 
of the protein, preferably in the S2 domain. This alpha-helical structure is thought to associate 
with at least two additional spike proteins to form a trimer. Helical or coiled regions of the spike 
protein are identified below. Predicted coiled-coils of SEQ ID NO: 6042 (spike protein) are at 
5 amino acids 900-1005 and 1151-1185 (see Figure 12). 

Accordingly, the invention comprises a polypeptide sequence comprising a fragment of 
SEQ ID NO: 6042 wherein said fragment includes a coiled region of SEQ ID NO: 6042. Said 
fragment preferably includes the amino acid sequences selected from the group consisting of 
amino acid positions 900 to 1005 and amino acid positions 1151 to 1185 of SEQ ID NO: 6042. 

10 The invention comprises a polypeptide sequence comprising a fragment of SEQ ID NO: 6042, 
wherein said fragment does not include a coiled region of SEQ ID NO: 6042. Said fragment 
preferably includes the amino acid sequences selected from the group consisting of amino acid 
positions 900 to 1005 and amino acid positions 1 151 and 1 1 85 of SEQ ID NO: 6042. 
The spike protein is believed to play an integral role in fusion and infection of 

15 Coronaviruses with mammalian host cells. Analysis of coronavirus spike proteins as well as 
similar surface proteins in other viruses has identified at least two structural motifs, typically 
located within the S2 domain, associated with this fusion event: heptad repeats (HR) and 
membrane fusion peptides. 

At least two 4,3 hydrophobic heptad repeat (HR) domains are typically found in the 

20 ectodomain of the S2 domain of Coronaviruses. One heptad repeat region (HR1) is typically 
located adjacent to a fusion peptide while a second heptad region (HR2) is typically located near 
the C-terminus of the S2 domain, close to the transmembrane anchor. Heptad repeats are 
characteristic of coiled-coil structures and the c heptad repeats found in viral surface proteins (such 
as coronavirus spike protein) are thought to form bundled helix structures which are involved in 

25 viral entry. See Bosch et al, J. Virology (2003) 77:8801-881 1 (Figure IB of this reference 
illustrates an alignment of the HR1 and HR2 regions of five coronaviruses along with SARS, 
annotated "HCov-SARS"). 

Heptad repeats generally contain a repeating structure of seven amino acids, designated a- 
b-c-d-e-f-g, where hydrophobic sidechains of residues a and d typically form an apolar stripe, 

30 and electrostatic interactions are found in residues e and g. Position a is most frequently Leu, lie 
or Ala and position d is usually Leu or Ala. Residues e and g are often Glu or Gin, with Arg and 
Lys also prominent at position g. Charged residues are common to positions b, c and /as these 
residues may be in contact with solvent Exceptions to these general parameters are known. For 
instance Pro residues are sometimes found within the heptad. 

35 The HR1 and HR2 sequences of an MHV strain have been postulated to assemble into a 

thermostable, oligomeric, alphahelical rold-like complex, with the HR1 and HR2 helices 
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oriented in an antiparallel manner. IcL In this same study, HR2 was asserted to be a strong 
inhitibor of both virus entry into the cell and cell-cell fusion. 

HR1 and HR2 sequences have been identified in the SARS virus genome. The SARS virus 
HR1 region comprises approximately amino acids 879 to 1005 of SEQ ID NO: 6042 or 
5 fragments thereof capable of forming at least one alpha-helical turn. Preferably, said fragments 
comprise at least 7 (e.g., at least 14, 21, 28, 35, 42, 49 or 56) amino acid residues. SEQ ID NO: 
7192, includes amino acids 879 to 1005 of SEQ ID NO: 6042. 

A preferred fragment of HR1 comprises amino acid residues 879 to 980 of SEQ ID NO: 
6042. This preferred fragment is SEQ ID NO: 7193. 
10 Another preferred fragment of HR1 comprises amino acid residues 901 to 1005 of SEQ 

ID NO: 6042. This preferred fragment is SEQ ID NO: 7194. 

The SARS virus HR2 region comprises approximately amino acids 1 144 to 1201 of SEQ 
ID NO: 6042, or fragments thereof capable of forming at least one alpha-helical turn. Preferably, 
said fragments comprise at least 7 (e.g. y at least 14, 21, 28, 35, 42, 49 or 56) amino acid residues. 
15 SEQ ID NO: 7195 includes amino acids 1 144 to 1201. A preferred fragment of HR2 comprises 
amino acids 1 144 to 1 195 of SEQ ID NO: 6042. This preferred fragment is SEQ ID NO: 7196. 

Membrane Fusion peptides sequences within the spike protein are also believed to 
participate in fusion (and infection) of the virus with a host cell. Fusion peptides generally 
comprise about 16 to 26 amino acid residues which are conserved within viral families. These 
20 Membrane Fusion peptides are relatively hydrophobic and generally show an asymmetric 

distribution of hydrophobitiy when modeled into an alpha helix. They are also generally rich in 
alanine and glycine. 

At least three hydrophobic Membrane Fusion peptide regions have been identified within 
coronaviruses (PEP1, PEP2, and PEP3). See, Luo et al, "Roles in Cell-Cell Fusion of Two 
25 Conserved Hydrophobic Regions in the Murine Coronavirus Spike Protein**, Virology (1998) 
244:483-494. Figure 1 of this paper shows an alignment of Membrane Fusion peptide sequences 
of Mouse Hepatitis Viris, Bovine Corona Virus, Feline Infectious Peritonitis Virus, 
Transmissible Gastroenteritis Virus and Infectious Bronchitis Virus. See also, Bosch et aL, "The 
Coronavirus Spike Protein is a Class I Virus Fusion Protein: Structural and Functional 
30 Characterization of the Fusion Core Complex** Journal of Virology (2003) 77(16):8801-881 1. 
PEP1 (SEQ ID NO: 7197), PEP2 (SEQ ID NO: 7198) and PEP3 (SEQ ID NO: 7199) 
sequences within the SARS spike protein have been identified. 

The coronavirus spike proteins (and other similar surface viral proteins) are thought to 
undergo a conformational change upon receptor binding tp the target cell membrane. One or 
35 more of the hydrophobic Membrane Fusion peptides are thought to become exposed and inserted 
into the target membrane as a result of this conformational change. The free energy released 
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upon subsequent refolding of the spike protein to its most stable conformation is believed to play 

a role in the merger of the viral and cellular membranes. 

One or more SARS HR sequences, preferably HR2, or a fragment thereof may be used to 

inhibit viral entry and membrane fusion with a target mammalian host cell. The invention 
5 provides a method of inhibiting viral infection comprising administering a composition 

comprising one or more SARS HR polypeptides or a fragment thereof. Preferably, the 

composition comprises a SARS HR2 sequence. 

In another embodiment, the invention includes a composition comprising a SARS HR1 

sequence, or a fragment thereof and a SARS HR2 sequence, or a fragment thereof. Hie HR1 and 
10 HR2 sequences may optionally be associated together in an oligomer. Hie composition may 

comprise the intermediate domain sequence between the HR1 and HR2 domains. The use of 

such an intermediate sequence may facilitate oligomerization or other structural interaction 

between the HR regions. 

HR sequences for use in the invention may be produced iecombinantly by methods known 
15 . in the art The SARS HR sequences may be modified to facilitate bacterial expression. In 

particular, the HR sequences may be modified to facilitate transport of the recombinant protein 

to the surface of the bacterial host cell. For example, leader sequences to a bacterial membrane 

protein may be added to the N terminus of the recombinant HR sequences. HR sequences for use 

in the invention may alternatively be produced by chemical synthesis by methods known in the 
20 art (see below). 

As discussed in more detail later in the specification, Applicants have identified structural 
similarities between the SARS spike protein and the surface protein of Neisseria meningitidis, 
NadA (and other similar bacterial adhesion proteins). Another means of facilitating bacterial 
expression of HR sequences includes the addition of the stalk and/or anchor sequences of a 
25 NadA-like protein to the C-terminus of the recombinant HR sequences. Recombinant sequences 
containing the bacterial anchor sequence may preferably be prepared in outer membrane vesicles 
(the preparation of which is discussed in more detail later in the application). Recombinant 
sequences missing the bacterial anchor sequences may be secreted and isolated from the 
supernatant 

30 The invention includes a polypeptide sequence comprising a first sequence and a second 

sequence, wherein said first sequence comprises a leader sequence for a bacterial membrane 
protein and wherein said second sequence comprises a HR sequence of a coronavirus. 
Preferably, said first sequence comprises the leader sequence for a bacterial adhesin protein. 
More preferably, said bacterial adhesion protein is NadA. Preferably said second sequence 

35 comprises HR1, HR2 or both. In one embodiment, the second sequence comprises HR1, HR2 
and the intermediate domain sequence present in the naturally occrding spike protein. For 
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example, the second sequence may comprise a fragment of a coronavirus spike protein 
comprising the amino acids starting with the N-terminus of the HR1 region and ending with the 
C-terminus of the HR2 region. 

Hie invention further includes a polypeptide sequence comprising a first, second, third and 
5 fourth sequence, wherein the first sequence comprises a leader sequence for a bacterial 

membrane protein; wherein said second sequence comprises a HR sequence of a coronavirus; 
wherein said third sequence comprises a stalk domain of a bacterial adhesion protein; and 
wherein said fourth sequence comprises an anchor domain of a bacterial adhesion protein. In 
one embodiment, the first sequence comprising the leader peptide sequence is removed. In 
10 another embodiment, the third sequence comprising the stalk domain is removed. In another 
embodiment, die fourth sequence comprising the anchor domain is removed. 

The polypeptide sequences of the above described constructs may be linked together by 
means known in the art, including, for example, via glycine linkers. 

' Examples of constructs which may be used, in such bacterial expression systems are shown 
15 in FIGURE 50. Polypeptide sequences of each of the constructs illustrated in FIGURE 50 are 
given as SEQ ID NOS: 7200 to 7206. 

7200 Leader NadA (1-29) -HRl (879-980) - 6Xgly - HR2 (1 144-1 195) - stalk+anchor NadA (88-405) 

7201 Leader NadA (1-29) -HRl (879-980) - 6Xgly-HR2 (1144-1196) - stalk NadA (88-351) 

7202 Leader NadA (1-29) - HRl - HR2 (879-1196) - stalk+anchor NadA (88-405) 

7203 Leader NadA (1-29) - HRl - HR2 (879-1196)-stalk NadA (88-351) 

7204 HRl : HR2 (879-1 196)-stalk NadA (88-35 l>6xhis 

7205 Leader NadA (1-29) - HRl - HR2 (879-1 196)-anchor NadA (351-405) 

7206 Leader NadA (1-29)- HRl -HR2 (879-1 196) 

Administration of one of more of these Membrane Fusion sequences may also interfere 
with the ability of a coronavirus to fuse to a host cell membrane. Accordingly, the invbntion 

£0 includes an isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 7197, SEQ ID NO: 7198and SEQ ID NO: 7199. The invention 
further includes an isolated polypeptide comprising an amino acid sequence having sequence 
homology to an amino acid sequence selected from the groujrconsisting of SEQ ID NO: 7197, 
SEQ ID NO: 7198and SEQ ID NO: 7199. 

15 Two or more of these SARS Membrane Fusion peptides can be combined together. The 

invention includes a composition comprising two SARS Membrane Fusion peptides wherein said 
peptides are selected from at least two of the amino acids selected from the group consisting of 
SEQ ID NO: 7197, SEQ ID NO: 7198and SEQ ID NO: 7199, or a sequence having sequence 
identity thereto. 

10 Two or more of the SARS Membrane Fusion peptides may be linked together. 

Accordingly, the invention includes a polypeptide comprising a first amino acid sequence and a 
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second amino acid sequence, wherein said first and second amino acid sequences are selected 
from the group consisting of SEQ ID NO: 7197, SEQ ID NO: 7198and SEQ ID NO: 7199, or a 
sequence having sequence identity thereto. Preferably, said first amino acid sequence and said 
second amino acid sequence are different SARS Membrane Fusion peptides, Le., they are not the 
5 same. 

Hie invention also includes a method of treating or preventing SARS virus infection 
comprising administering one or more of the SARS Membrane Fusion peptide compositions 
described above. 

As discussed above, the spike protein is capable of forming trimers. Hie invention further 
10 includes a polypeptide comprising SEQ ID NO: 6042 in trimeric form. The invention includes a 
composition comprising at least polypeptides wherein each polypeptide comprises at least the 
alpha-helical coiled region of a SARS vims spike protein. Preferably, the spike protein 
comprises SEQ ID NO: 6042 or a fragment thereof. 

The invention further includes a composition comprising a SARS virus spike protein or a * 
15 fragment thereof wherein said protein is associated with a transmembrane and wherein said 
fragment comprises the alpha-helical region of the SARS virus spike protein. Preferably, the 
composition comprises at least three SARS virus spike proteins or a fragment thereof, wherein 
the fragment comprises the alpha-helical region of the SARS virus spike protein. 

Hie invention further includes an antibody which specifically binds to a trimeric form of 
20 SARS virus spike proteins. Preferably, the spike protein comprises SEQ ID NO: 6042 or a 

fragment thereof. The invention includes an antibody which specifically binds to a trimeric form 
of SARS virus spike proteins wherein said proteins are associated with a transmembrane. 

The invention further includes an antibody which specifically binds to a monomelic form 
of SARS virus spike protein or a fragment thereof. Preferably, the antibody specifically binds to 
25 a monomeric form of SEQ ID NO: 6042 or a fragment thereof. 

The invention further includes a small molecule which interferes with or disrupts the 
coiling of a SARS viral spike protein trimer. 

The invention further includes an attenuated SARS virus for use as a vaccine wherein said 
attenuated virus contains a polynucleotide insertion, deletion or substitution which does not 
JO disrupt the trimeric conformation of the SARS virus spike protein. The invention further 

includes an attenuated SARS virus for use as a vaccine wherein said attenuated virus contains a 
polynucleotide insertion, deletion or substitution which does not disrupt the alpha-helical 
formation of the SARS virus spike protein. 

The spike protein may be recombinantly produced. In one embodiment, the spike protein 
15 is expressed in virus like particles so that the protein is attached to a cell membrane. Such 

attachment may facilitate presentation of immunogenic epitopes of the spike protein. Preferably, 
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the alpha-helical portion of the spike protein is associated with the cell membrane. Preferably, 
the spike proteins form a trimer within the transmembrane region of attachment 
Predicted N-glycosylation sites of SEQ ID NO: 6042 are identified below: 



Position 






Potential 


Jury 


NGlyc 












agreement result 


29 NYTQ 


SEQ ID NO: 


7207 


0 


.7751 


(9/9) 


+++ 


65 NVTG 
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+++ 
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0 
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(7/9) 
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++ 
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(7/9) 


+ 


1116 NNTV 


SEQ ID NO: 


7222 


0 


.5106 


(5/9) 


+ 


1176 NESL 


SEQ ID NO: 


7223 


0 


.6796 


(9/9) 


++ 



Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6042 wherein 3aid fragment comprises one or more of the glycosylation sites identified above 
(SEQ ID NOS: 7207-7223). The invention further includes a polynucleotide encoding one or 
more of the fragments identified above. This glycosylation site can be covalently attached to a 
saccharide. Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO: 6042 wherein said fragment comprises one or more of the glycosylation sites identified 
above and wherein said polypeptide is glycosylated at one or more of the sites identified above. 

Predicted O-glycosylation sites are identified below: 



Residue No. Potential Threshold Assignment 



Thr 


698 


0.8922 


0.7696 


T 


Thr 


706 


0.9598 


0.7870 


T 


Thr 


922 


0.9141 


0.7338 


T 


Ser 


36 


0.8906 


0.7264 


S 


Ser 


703 


0.8412 


0.7676 


S 



The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6042 wherein 
said fragment comprises one or more of the O-glycosylation sites identified above. Hie 
invention further includes a polynucleotide encoding one or more of the fragments identified 
above. The invention further includes a polypeptide comprising a fragment of SEQ ID NO: 
6042 wherein said fragment comprises one or more of the O-glycosylation sites identified above 
and further wherein the polypeptide is covalently bonded to a saccharide at one or more of the 
included glycosylation sites. 

Hie invention further includes a polypeptide comprising a fragment of SEQ ID NO: 6042 
wherein said fragment comprises one or more of the N-glycosylation sites identified above and 
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further wherein said fragment comprises one or more of the O-glycosylation sites identified 
above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6042 wherein 
said fragment does not include one or more of the glycosylation sites identified above. Hie 
5 invention also includes a polynucleotide encoding such a polypeptide. 

Predicted phosphorylation sites of SEQ ID NO: 6042 are Ser-346, Tyr-195, and Tyr-723. 
Accordingly, the invention comprises a polypeptide comprising a fragment of SEQ ID NO: 6042 
wherein said fragment comprises at least ten amino acid residues and wherein said fragment 
comprises one or more of the amino acids selected from the group consisting of Ser-346, Tyr- 
10 195, and Tyr-723. In one embodiment, one or more of the amino acids selected from the group 
consisting of Ser-346, Tyr-195, and Tyr-723 are phosphorylated. 

Expression and functional characterization of the spike glycoprotein has been described by 
XiaoetaL (2003) Biochem Biophys Res CoTnm 312:1159-1164. 

T-epitopes for SEQ ID NO: 6042 are identified in Table 16. The invention includes a 
15 polypeptide for use as an antigen, wherein die polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified as SEQ ID 
NOS: 8041-8280; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
20 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8041-8280, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
25 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

30 The invention provides a method of raising an immune response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

Hie invention includes a polypeptide comprising SEQ ID NO: 6040. The invention 
35 includes a polypeptide comprising an amino acid sequence having sequence identity to SEQ ID 
NO: 6040. The invention includes a fragment of a polypeptide comprising SEQ ID NO: 6040. 
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Hie invention includes a polynucleotide encoding SEQ ID NO: 6040 or a fragment thereof. Hie 
invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6040 or a 
fragment thereof. Hie invention includes a diagnostic kit comprising a polynucleotide sequence 
encoding SEQ ID NO: 6040 or a fragment thereof. The invention includes an antibody which 
5 recognizes a polypeptide comprising SEQ ID NO: 6040 or a fragment thereof. 

SEQ ID NO: 6040 demonstrates functional homology with a membrane protein of 
coronaviruses. Predicted transmembrane helices of SEQ ID NO: 6040 are identified below: 
Predicted Transmembrane Helices 

10 The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 

Inside to outside helices : 3 found 
from to score center 

15 27 ( 30) 48 ( 45) 1138 38 

137 ( 139) 153 ( 153) 486 146 

Outside to inside helices : 3 found 
from to score center 

20 28 ( 31) 45 ( 45) 819 38 

71 ( 73) 90 ( 90) 210 81 
136 { 142) 156 ( 156) 272 149 

Hie amino acid region with the highest predicted transmembrane helical region is from 

25 amino acid position 27 to 48 of SEQ ID NO: 6040. Such transmembrane regions are often 
difficult to express recombinantly. Accordingly, the invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6040 wherein said fragment does not include the amino 
acid sequence between positions 27 to 48. The invention includes a polypeptide comprising a 
fragment of SEQ ID NO: 6040 wherein said fragment does not include the amino acid sequence 

30 between positions 28 to 45. Hie invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

SEQ ID NO: 6040 is predicted to be a hypothetical protein of the SARS virus. A 
prediction of the protein localization of SEQ ID NO: 6040 is set forth below. SEQ ID NO: 6040 
is predicted to be located in one of the following locations: mitochondrial matrix space, 

35 microbody (peroxisome), nucleus, and mitochondrial inner membrane. SEQ ID NO: 6040 is 
predicted to be associated with an organelle inside an infected cell. 

Accordingly, SEQ ID NO: 6040 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6040 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 

40 NO: 6040 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 
6040 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6040 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
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ID NO: 6040 from associating with an organelle inside of an infected cell. Hie invention 
includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 6040. 

PSORT Prediction of Protein Localization Sites 

version 6.4 (WWW) 

5 SEQ ID NO: 6040 163 Residues 

Species classification: 4 

*** Reasoning Step: 1 

10 Preliminary Calculation of ALOM t threshold: 0.5) 

count : 0 

McG: Examining signal sequence (McGeoch) 
Length of UR: 9 
Peak Value of UR: 1.75 
15 Net Charge of CR: 1 

Discriminant Score: -2.56 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 1.94 
Possible cleavage site: 53 
20 »> Seems to have no N-terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : 0B 
25 ALOM: finding transmembrane regions (Klein et al . ) 

count: 0 value: 1.32 threshold: -2.0 
PERIPHERAL Likelihood =1.32 
modified ALOM score: -1.16 
Gavel: Examining the boundary of mitochondrial targeting seq. 
30 motif at: 156 

HRSVTI 

Discrimination of mitochondrial target seq. : 

notclr ( 0.88) 
Rule: mitochondrial protein 
35 Rule: mitochondrial protein 

Rule: mitochondrial protein 
Rule: mitochondrial protein 



*** Reasoning Step: 2 

40 

KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 156) from: 27 to: 44 Score: 5.0 
Mitochondrial matrix? Score: 0.36 
45 SKL motif (signal for peroxisomal protein) : 

pos: 99(163), count: 1 SKL 
SKL score (peroxisome): 0.3 
Amino Acid Composition Tendency for Peroxisome: -4.28 
Peroxisomal proteins? Status: notclr 
50 Amino Acid Composition tendency for lysosomal proteins 

score: 0.02 Status: notclr 
Modified score for lysosome: 0.152 
Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
55 Found: pos: 132 (5) KRKR 

Checking the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
Checking the RNA binding motif (nucleus or cytoplasm) 
nuc modified. Score: 0.60 
60 Nuclear Signal Status: notclr { 0.30) 
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Checking CaaX motif. . 
Checking N-rayristoylation . . 
Checking CaaX motif . . 

5 Final Results 

mitochondrial matrix space Certainty^ 0.480 (Affirmative) < suco 

microbody (peroxisome) Certainty^ 0.300 (Affirmative) < suco 

nucleus Certainty^ 0.300 (Affirmative) < suco 

mitochondrial inner membrane Certain ty= 0.188 (Affirmative) < suco 

10 

Predicted N-glycosylation sites of SEQ ID NO: 6040 are identified below. 

Position Potential Jury NGlyc 

agreement result 

15 2 NKTG (SEQ ID NO: 7255) 0.7804 (9/9) +++ 

106 NLTL (SEQ ID NO: 7256) 0.6123 (7/9) + 

Accordingly, the invention comprises a fragment of SEQ ID NO: 6040 wherein said 
fragment is at least ten amino acids and wherein said fragment comprises one or more of the 

20 asparagines from the amino acid positions of SEQ ID NO: 6040 selected from the group 

consisting of 2 and 106. The invention includes a fragment of SEQ ID NO: 6040 wherein said 
fragment comprises one or more amino acid sequences selected from the group consisting of 
SEQ ID NO: 7255 and SEQ ID NO: 7256. Preferably, the fragment comprises the amino acid 
sequence NKTG (SEQ ID NO: 7255). 

25 The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6040 wherein 

said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6040 are identified in Table 14. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

30 sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 7640-7800; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 

35 comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ED NOS: 7640-7800, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 

40 CTL response. The use preferably protects or treats disease and/or infection caused by a S ARS 
virus. 
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Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
5 human), comprising the step of administering to die mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. Hie immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6041. SEQ ID NO: 6041 
demonstrates functional homology with a portion of an ORF lab polyprotein. The invention 

10 includes a polypeptide comprising an amino acid sequence having sequence identity to SEQ ID 
NO: 6041. Hie invention includes a fragment of a polypeptide comprising SEQ ID NO: 6041. 
The invention includes a polynucleotide sequence encoding an amino acid sequence having 
sequence identity to SEQ ID NO: 6041. The invention includes a polynucleotide encoding a 
fragment of a polypeptide comprising SEQ ID NO: 6041. 

15 Hie invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 

6041 or a fragment therof . The invention includes a diagnostic kit comprising a polynucleotide 
encoding SEQ ID NO: 6041 or a fragment thereof. The invention includes an immunogenic 
composition comprising a polypeptide comprising SEQ ID NO: 6041 or a fragment thereof. The 
invention includes an antibody which recognizes a polypeptide comprising SEQ ID NO: 6041 or 

20 a fragment thereof. 

The polyproteins of coronaviruses are associated with enzymatic activity. Accordingly, 
SEQ ID NO: 6041 is a target for screening of chemical inhibitors to the SARS virus. The 
invention includes a polypeptide comprising SEQ ID NO: 6041 or a fragment thereof. The 
invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID NO: 6041 or 

25 a fragment thereof. Hie invention includes a method of screening SEQ ID NO: 6041 for an 

inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6041 in a host cell. . 
The invention includes a small molecule which prevents the polypeptide of SEQ ID NO: 6041 
from performing enzymative activity. The invention includes a fusion protein wherein said 
fusion protein comprises SEQ ID NO: 6041. 

30 Predicted transmembrane or hydrophobic regions of SEQ ID NO: 6041 are identified 

below. Although the polyprotein of coronaviruses is proteolytically cleaved into numerous 
smaller proteins, hydrophobic domains in the polyprotein are known to mediate the membrane 
association of the replication complex and to be able to dramatically alter the architecture of host 
cell membranes. Accordingly, die hydrophobic domains of the polyprotein are targets for 

35 genetic mutation to develop attenuated SARS virus vaccines. The hydrophobic domains are also 
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targets for small molecule inhibitors of the SARS virus- Hie hydrophobic domains may also be 
used to generate antibodies specific to those legions to treat or prevent SARS virus infection. 

Possible transmembrane helices of SEQ ZD NO: 6041 

The sequence positions in brackets denominate the core region. 
Only scores above 500 are considered significant. 

Inside to outside helices : 18 found 





from 


to 


score 


center 


234 


( 234) 254 


( 250) 


1046 


241 


256 


( 256) 272 


( 270) 


252 


263 


319 


( 319) 334 


( 334) 


227 


327 


503 


( 505) 522 


( 519) 


405 


512 


613 


( 615) 633 


( 629) 


619 


622 


677 


( 679) 703 


( 696) 


467 


689 


849 


( 851) 869 


( 865) 


229 


858 


1080 


(1080)1097 


(1094) 


306 


1087 


1147 


(1149)1163 


(1163) 


354 


1156 


1557 


(1557)1581 


(1577) 


817 


1567 


1954 


(1954)1971 


(1971) 


832 


1964 


2369 


(2372)2395 


(2387) 


300 


2379 


2513 


(2513)2532 


(2529) 


690 


2522 


Outside to inside helices : 14 fount 




from 


to 


score 


center 


239 


{ 239) 254 


( 254) 


924 


247 


239 


( 248)' 272 


( 263) 


468 


256 


311 


( 314) 334 


( 328) 


267 


321 


499 


{ 503) 522 


( 519) 


485 


512 


617 


( 617) 634 


( 631) 


425 


624 


849 


( 853) 872 


( 872) 


572 


864 


1147 


(1147)1162 


(1162) 


765 


1155 


1564 


(1564)1581 


(1579) 


883 


1572 


1951 


(1951)1968 


(1966) 


657 


1958 


2513 


(2522)2539 


(2537) 


711 


2529 



10 



15 



20 



25 



30 



35 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6041, wherein said fragment comprises an amino acid sequence including one or more of the 
hydrophobic transmembrane sequences identified above. The invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6041 wherein said fragment comprises one or more of the 

40 following polypeptide sequences of SEQ ID NO: 6041: 234-254, 613-633, 1557-1581, 1954- 
1971, 2513-2532, 239-254, 1564-1581, 1951-1968, 2513-2539. Preferably, the fragment 
comprises one or more of the following polypeptide sequences of SEQ ID NO: 6041: 234-254 
and 239-254. The invention also includes polynucleotides encoding each of the polypeptide 
fragments identified above. 

45 The invention includes an attenuated SARS virus wherein said attenuated SARS virus 

contains an addition, deletion or substitution in the polynucleotides encoding for one of the 
hydrophobic domains identified above. Hie invention also includes a method for creating an 
attenuated SARS virus comprising mutating a SARS virus by adding, deleting or substituting the 
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viral genome of the S ARS virus to alter the coding of one or more of the hydrophobic domains 
of SEQ ID NO: 6041 identified above. 

The invention includes an antibody which specifically identifies one or more of the 
hydrophobic regions of SEQ ID NO: 6041 identified above. Hie invention includes a small 
5 molecule which binds to, interferes with the hydrophobicity of or otherwise disrupts one or more 
of the hydrophobic regions of SEQ ID NO: 6041 identified above. 

Predicted N-glycosylation sites of SEQ ID NO: 6041 are identified below: 



10 



Position 








Potential 


Jury 


NGlyc 














agreement result 


571 


NLSH 


(SEQ 


ID NO: 


7257) 


0.6598 


(8/9) 


+ 


835 


NTSR 


(SEQ 


ID NO: 


7258) 


0.5762 


(7/9) 


+ 


958 


NVTD 


(SEQ 


ID NO: 


7259) 


0.7494 


(9/9) 


++ 


1113 


NISD 


(SEQ 


ID NO: 


7260) 


0.7259 


(8/9) 


+ 


1205 


NSTL 


(SEQ 


ID NO: 


7261) 


0.6296 


(9/9) 


++ 


1460 


NVTG 


(SEQ 


ID NO: 


7262) 


0.6844 


(9/9) 


++ 


1685 


NHSV 


(SEQ 


ID ,NO: 


7263) 


0.5181 


(5/9) 


+ 


2029 


NKTT 


(SEQ 


ID NO: 


7264) 


0.5423 


(5/9) 


+ 



Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 

20 acid sequence of SEQ ID NO: 6041; wherein said fragment comprises one or more of the N- 
glycosylation sites identified above. The invention comprises a polypeptide comprising a 
fragment of SEQ ID NO: 6041 wherein said fragment comprises one or more of sequences SEQ 
ID NOS: 7257-7264. Preferably, the fragment comprises one or more of the sequences SEQ ID 
NOS: 7257, 7259, 7260, 7261 and 7262. The invention further comprises a polynucleotide 

25 encoding one or more of the polypeptides identified above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6041 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6041 are identified in Table 15. Hie invention includes a 

30 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 7801-8040; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). Hie invention further comprising a method of expression or delivery 

35 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 7801-8040, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 

40 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
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CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. 

The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
5 is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 
10 Hie invention includes a polypeptide sequence SEQ ID NO: 6043 or a fragment thereof. 

The invention includes a polypeptide comprising an amino acid sequence having sequence 
identity to SEQ ID NO: 6043. Hie invention includes a polynucleotide sequence encoding the 
amino acid sequence of SEQ ID NO: 6043 or a fragment thereof. 

Predicted transmembrane regions of SEQ ID NO: 6043 are set forth below. 

15 Inside to outside helices : 4 found 

from to score center 

41 ( 41) 56 ( 56) 1789 49 

76 ( 79) 99 ( 99) 2142 89 

105 ( 105) 125 ( 125) 1250 115 

20 

Outside to inside helices : • 3 found 

from to score center 

41 ( 41) 59 ( 56) 2053 49 

76 ( 82) 98 ( 96) 1580 89 

25 103 ( 105) 125 { 123) 1257 115 

The amino acid region with the highest predicted transmembrane helical region is from 
amino acid position 76 to 99 of SEQ ID NO: 6043. Such transmembrane regions are often 
difficult to express recombinantly. Accordingly, the invention includes a polypeptide 
30 comprising a fragment of SEQ ID NO: 6043 wherein said fragment does not include one or more 
of the hydrophobic amino acid sequences identified above. Preferably, the fragment does not 
include the amino acids between positions 27 to 48. The invention also includes a 
polynucleotide sequence encoding any of the above-identified polypeptides. 

SEQ ID NO: 6043 is predicted to be a hypothetical protein of the SARS virus. A 
35 prediction of the protein localization of SEQ ID NO: 6043 is set forth below. SEQ ID NO: 6043 
is predicted to be located in one of the following locations: mitochondrial inner membrane, 
plasma membrane, Golgi body, and mitochondrial intermembrane space. SEQ ID NO: 6043 
may be associated with an organelle inside an infected cell. 

Accordingly, SEQ ID NO: 6043 is a target for screening of chemical inhibitors to the 
40 SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6043 or a fragment 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 

-45- 



WO 2004/092360 



PCT/US2004/011710 



NO: 6043 or a fragment thereof. Hie invention includes a method of screening SEQ ID NO: 
6043 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6043 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6043 from associating with an organelle inside of an infected cell. Hie invention 
5 includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 6043. 

PSORT Prediction of Protein Localization Sites for SEQ ID NO: 6043 

version 6. 4 (WWW) 

Species classification: 4 

10 *** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 3 

Position of the most N- terminal TMS: 40 at i=2 
15 MTOP: membrane topology (Hartmann et al.) 

I (middle): 47 Charge diffirence (C-N) : 3.5 
McG: E xamin ing signal sequence (McGeoch) 
Length of UR: 12 
Peak Value of UR: 1.41 
20 Net Charge of CR: 0 

Discriminant Score:* -4.67 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 3.44 
Possible cleavage site: 15 
25 »> Seems to have no N- terminal signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM new cnt: 2 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : 0B 
30 ALOM: finding transmembrane regions (Klein et al.) 

count: 2 value: -6.90 threshold: -2.0 

INTEGRAL Likelihood = -6.90 Transmembrane 83 - 99 ( 78 - 101) 
INTEGRAL Likelihood = -5.04 Transmembrane 40 - 56 ( 37 - 60) 
PERIPHERAL Likelihood = -0.32 ( 
35 modified ALOM score: 1.48 

»> Likely a Type Illb membrane protein (Nexo Ccyt) 
Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 128 
MRCWLC 

40 Discrimination of mitochondrial target seq. : 

notclr ( 0.76) 

Rule: mitochondrial protein 

Rule: mitochondrial protein 

Rule: mitochondrial protein 
45 Rule: mitochondrial protein 

*** Reasoning Step: 2 

Type Ilia or Illb is favored for ER memb. proteins 
50 KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 128) from: 39 to: 56 Score: 11.5 
»> Seems to have an intramitochondrial signal 
Mitochondrial inner membrane? Score: 0.59 
55 Mitochondrial intermemb. space? Score: 0.22 

SKL motif (signal for peroxisomal protein) : 
pos: 92(274), count: 1 SHL 
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SKL score (peroxisome); 0.3 
Amino Acid Composition Tendency for Peroxisome: 4.78 
Peroxisomal proteins? Status: positive 
Amino Acid Composition tendency for lysosomal proteins 
5 score: 1.16 Status: notclr 

Type HI proteins may be localized at Golgi 
Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Checking the 7 residue pattern for Nuclear Targeting 
10 Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 
Check the Number of TMSs for typelll (plasma memb.) 
Checking N-myristoylation . . 

15 

Final Results 

mitochondrial inner membrane — - Certainty^ 0.664 (Affirmative) < suco 

plasma membrane Certainty** 0.600 (Affirmative) < suco 

Golgi body — - Certainty^ 0.400 (Affirmative) < suco 
20 mitochondrial intexmambrane space Certainty^ 0*362 (Affirmative) < suco 

Predicted N- and O- glycosylation sites of SEQ ID NO: 6043 are identified below. 

Position Potential Jury NGlyc 

agreement result 
25 227 NATF (SEQ ID NO: 7265) 0.6328 (7/9) + 

Residue No. Potential Threshold Assignment 



30 



35 



Thr 


28 


0.9095 


0.6280 


T 


Thr 


32 


0.8740 


0.6595 


T 


Thr 


34 


0.9058 


0.6655 


T 


Thr 


170 


0.6816 


0.6600 


T 


Thr 


267 


0.9240 


0.5779 


T 


Thr 


268 


0.7313 


0.5708 


T 


Thr 


269 


0.9859 


0.5583 


T 


Thr 


270 


0.8023 


0.5492 


T 


Ser 


27 


0.6930 


0.6091 


S 


Ser 


252 


0.6457 


0.5977 


S 



Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
40 acid sequence of SEQ ID NO: 6043, wherein said fragment comprises the N-glycosylation sites 
or O-glycosylation sites identified above. The invention comprises a polypeptide comprising a 
fragment of SEQ ID NO: 6043 wherein said fragment comprises one or more of the N- 
glycosylation sites or O-glycosylation sites identified above. The invention further comprises a 
polynucleotide encoding one or more of the polypeptides identified above. 
45 The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6043 wherein 

said fragment does not include one or more of the glycosylation sites identified above. The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6043 are identified in Table 17. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
50 sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8281-8486; (b) an amino acid sequence having sequence identity to an amino acid 
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sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
5 ID NOS: 8281-8486, or a polynucleotide encoding such a polypeptide. 

Hie use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein ie.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
10 virus. Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
IS wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. Hie immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6044. The invention 
includes a polypeptide comprising a fragment of SEQ ID NO: 6044 or a sequence having 
sequence identity to SEQ ID NO:206. The invention includes a polynucleotide encoding SEQ 
20 ID NO: 6044. 

SEQ ID NO: 6044 is identified as a hypothetical protein. Predicted hydrophobic or 
transmembrane regions of SEQ ID NO: 6044 are identified below: 

Inside to outside helices : 3 found 
from to score center 

25 1 ( 1) 17 ( 15) 891 8 

47 { 47) 66 ( 63) 221 56 

Outside to inside helices : 4 found 
from to score center 

30 1 ( 4) 21 < 19) 599 11 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6044 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. Preferably, the fragment does not include the amino acids between 
35 positions 1 to 19. The invention also includes a polynucleotide sequence encoding any of the 
above-identified polypeptides. 

SEQ ID NO: 6044 is predicted to be a hypothetical protein of SARS virus. A prediction of 
the protein localization of SEQ ID NO: 6044 is set forth below. SEQ ID NO: 6044 is predicted 
to be located in one of the following locations: nucleus, mitochondrial matrix, lysosome (lumen), 
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and microbody (peroxisome). SEQ ID NO: 6044 may be associated with an organelle inside an 
infected cell. 

Accordingly, SEQ ID NO: 6044 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6044 or a fragment 

5 thereof . Hie invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6044 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 
6044 for an inhibitor. Hie invention includes the recombinant expression of SEQ ID NO: 6044 
in a host cell. Hie invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6044 from associating with an organelle inside of an infected cell. Hie invention 

10 includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 6044. 

PSORT Prediction of Protein Localization Sites for SEQ ID NO: 6044 

version 6. 4 (WWW) 

154 Residues 

Species classification: 4 

15 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 0 

20 McG: Examining signal sequence (McGeoch) 

Length of X7R: 7 
Peak Value of UR: 1.06 
Net Charge of CR: 1 
Discriminant Score: -7.97 
25 GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): -3.28 
Possible cleavage site: 34 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
30 calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in AIiOM?: OB 
ALOM: finding transmembrane regions (Klein et al.) 
count: 0 value: 1.43 threshold: -2.0 
35 PERIPHERAL Likelihood = 1.43 

modified ALOM score: -1.19 
Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 151 
FRKKQV 

40 Discrimination of mitochondrial target seq. : 

notclr (-0.46) 

*** Reasoning Step: 2 

45 KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 151) from: 46 to: 50 Score: 5.0 

Mitochondrial matrix? Score: 0.36 

SKL motif (signal for peroxisomal protein) : 
50 pos: -1(154), count: 0 

Amino Acid Composition Tendency for Peroxisome: 0.61 

Peroxisomal proteins? Status: notclr 
AAC score (peroxisome) : 0.149 

Amino Acid Composition tendency for lysosomal proteins 
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score: 0.81 Status: notclr 

Modified score for lysosome: 0.231 

Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
5 Found: pos: 134 (3) KHKK 

Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 
Found: pos: 136 (3) KK VSTNLCTHSF RKKQV 

Final Robbins Score (nucleus): 0.60 
10 Checking the RNA binding motif (nucleus or cytoplasm) 

nuc modified. Score: 0.90 

Nuclear Signal Status: positive ( 0.70) 

Checking CaaX motif. . 

Checking N-myristoylation. . 
15 Checking CaaX motif . . 

Final Results 

nucleus Certainty^ 0 .880 (Affirmative) < suco 

mitochondrial matrix space — - Certainty^ 0.360 (Affirmative) < suco 

20 lysosome (lumen) Certainty^ 0.231 (Affirmative) < suco 

microbody (peroxisome) Certainty^ 0.149 (Affirmative) < suco 

One predicted O-glycosylation site of SEQ ID NO: 6044 is identified at residue 4: 

Residue No. Potential Threshold Assignment 
25 Thr 4 0.6839 (K6484 T 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6044, wherein said fragment comprises the O-glycosylation site 
identified above. The invention further comprises a polynucleotide encoding one or more of the 

30 polypeptides identified above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6044 wherein 
said fragment does not include one or more of the glycosylation sites identified above: The 
invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6044 are identified in Table 18. The invention includes a 

35 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8487*8665; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 

40 of such polynucleotides through viral vectors and/or viral particles. Hie invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8487-8665, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 

45 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
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vims. Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
5 human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide sequence comprising SEQ ID NO: 6045. The 
invention includes a polypeptide sequence comprising an amino acid sequence having sequence 
10 identity to SEQ ID NO: 6045. Hie invention includes a polypeptide sequence comprising a 

fragment of SEQ ID NO: 6045. Hie invention includes a polynucleotide sequence encoding any 
of these polypeptides. 

SEQ ID NO: 6045 demonstrates functional homology with the envelope or small 
membrane protein of coronaviruses. The invention includes a diagnostic kit comprising a 
15 polypeptide comprising SEQ ID NO: 6045 or a fragment thereof. The invention includes a 
diagnostic kit comprising a polynucleotide encoding SEQ ID NO: 6045 or a fragment thereof. 
The invention includes an immunogenic composition comprising SEQ ID NO: 6045 or a 
fragment thereof. The invention includes an antibody which recognizes a polypeptide 
comprising SEQ ID NO: 6045 or a fragment thereof. 
20 Predicted transmembrane regions of SEQ ID NO: 6045 are identified below: 

Inside to outside helices : 1 found 
from to score center 

17 ( 19) 33 ( 33) 2881 26 ■ 

25 Outside to inside helices : 1 found 

from to score center 

17 ( 17) 34 ( 34) 2981 27 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
30 6045 wherein said fragment does not include one or more of the hydrophobic amino acid 

sequences identified above. Preferably, the fragment does not include the amino acids between 
positions 17 to 34. The invention also. includes a polynucleotide sequence encoding any of the 
above-identified polypeptides. In one embodiment, the invention includes a polypeptide 
comprising a fragment of SEQ ID NO: 6045 wherein said fragment does not include amino acid 
35 residues 1-34 of SEQ ID NO: 6045. 

Predicted protein Localization Site of SEQ ID NO: 6045 is below. 

PSORT Prediction of Protein realization Sites for SEQ ID NO: 6045 

version 6.4 (WWW) 

Species classification: 4 

40 

*** Reasoning Step: 1 
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Preliminary Calculation of ALOM (threshold: 0.5) 
count: 2 

Position of the most N-terxninal IMS: 17 at i=l 
5 MTOP: membrane topology {Hartmann et al.) 

I (middle): 24 Charge dif firence(C-N) : 2.0 
McG: Examining signal sequence (McGeoch) 
Length of OR: 29 
Peak Value of DR: 3.40 
10 Net Charge of CR: -2 

Discriminant Score: 13.07 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 4.37 
Possible cleavage site: 32 

15 ... positive value of mtop 

»> Seems to have an uncleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
ALOM new cnt: 1 ** thrshld changed to -2 
20 Cleavable signal was detected in ALOM? : 0B 

ALOM: finding transmembrane regions (Klein et al.) 
count: 1 value: -15.12 threshold: -2.0 

INTEGRAL Likelihood =-15.12 Transmembrane 17 - 33 ( .8 - 44) 
PERIPHERAL Likelihood =0.47 
25 modified ALOM score: 3.12 

»> Seems to be a Type lb (Nexo Ccyt) membrane protein 

The cytoplasmic tail is from 34 to 76 (44 Residues) 
Rule: vesicular pathway 
Rule: vesicular pathway 
10 Rule: vesicular pathway 

( 6) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 6 
Uncleavable? Ipos set to: 16 
*5 Discrimination of mitochondrial target seq. : 
notclr (0.19) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

\0 

*** Reasoning Step: 2 

> Relative position of the end of the tail: 44% 

Memb. protein with uncleavable signl is often at ER 
\5 KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 16) from: 70 to: 99 Score: 21.5 

>» Seems to have an intramitochondrial signal 

SKL motif (signal for peroxisomal protein) : 
»0 pos: -1(76), count: 0 

Amino Acid Composition Tendency for Peroxisome: -4.11 

Peroxisomal proteins? Status: negative 

Amino Acid Composition tendency for lysosomal proteins 
score: 0.68 Status: notclr 
•5 Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 

Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 
0 Nuclear Signal Status: negative (0.00) 

Check cytoplasmic tail for typelb (plasma memb.) 
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10 



Checking the NPXY motif. . 
Checking the YXRF motif . . 
Checking N-myristoylation. . 

Final Results 

plasma membrane Certainty^ 0 -730 (Affirmative) < suco 

endoplasmic reticulum (membrane) Certainty 0 0.640 (Affirmative) < suco 

endoplasmic reticulum (lumen) Certainty* 0.100 (Affirmative) < suco 

outside Certainty 0.100 (Affirmative) < suco 

Predicted N-glycosylation sites of SEQ ID NO: 6045 are identified at residues 48 and 66: 

Position Potential Jury NGlyc 

agreement result 
48 NVSL 0.6514 (9/9) ++ (SEQ ID NO: 7266) 

15 66 NSSE 0.5880 (7/9) + (SEQ ID NO: 7267) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6045, wherein said fragment comprises one or more of the 
N-glycosylation sites identified above. The invention comprises a polypeptide comprising a 

20 fragment of SEQ ID NO: 6045 wherein said fragment comprises one or more of the N- 

glycosylation sites identified above. Hie invention further comprises a polynucleotide encoding 
one or more of the polypeptides identified above. 

The invention includes a polypeptide comprising a fragment of SEQ ID NO: 6045 wherein 
said fragment does not include one or more of the glycosylation sites identified above. The 

15 invention also includes a polynucleotide encoding such a polypeptide. 

T-epitopes for SEQ ID NO: 6045 are identified in Table 19. The invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 8666-8820; (b) an amino acid sequence having sequence identity to an amino acid 

$0 sequence of (a). The invention further comprising a polynucleotide sequence encoding the 

polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 8666-8820, or a polynucleotide encoding such a polypeptide. 

15 The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 

complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 

■0 immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 
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The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 
5 The invention includes a polypeptide sequence comprising SEQ ID NO: 6046. The 

invention includes polypeptide sequences comprising an amino acid sequence having sequence 
identity to SEQ ID NO: 6046. The invention includes a polypeptide sequence comprising a 
fragment of SEQ ID NO: 6046. The invention includes a polynucleotide encoding one of these 
polypeptides. 

10 SEQ ID NO: 6046 has functional homology with a matrix protein of a coronavirus. The 

invention includes a diagnostic kit comprising a polypeptide comprising SEQ ID NO: 6046 or a 
fragment thereof. The invention includes a diagnostic kit comprising a polynucleotide encoding 
SEQ ID NO: 6046 or a fragment thereof. The invention includes an immunogenic composition 
comprising SEQ ID NO: 6046 or a fragment thereof. Hie invention includes an antibody which 

15 recognizes a polypeptide comprising SEQ ID NO: 6046 or a fragment thereof. 

Predicted transmembrane regions of SEQ ID NO: 6046 are identified below. 

Inside to outside helices : 3 found 
from to score center 

21 ( 21) 38 ( 36) 2412 29 
20 51 ( 53) 69 ( 69) 2645 60 

74 ( 82) 96 ( 96) 2464 89 

Outside to inside helices : 3 found 
from to score center 

25 18 ( 21) 38 ( 38) 2363 28 

52 ( 52) 67 ( 67) 2363 60 
76 ( 76) 95 ( 92) 2605 84 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
30 6046 wherein said fragment does not include one or more of the hydrophobic amino acid 

sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected from the group consisting of 18 to 38, 52 to 67 and 76 to 95. The invention 
also includes a polynucleotide sequence encoding any of the above-identified polypeptides. 
Predicted protein localization of SEQ ID NO: 6046 is set forth below. 

35 PSORT Prediction of Protein Localization Sites 

version 6.4 (WWW) 

Species classification: 4 
*** Reasoning Step: 1 

40 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 3 

Position of the most N- terminal TMS: 21 at i=l 
MTOP: membrane topology (Hartmann et al.) 
45 I (middle): 28 Charge dif firence(C-N) : 6.0 
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McG: Examining signal sequence (McGeoch) 
Length of UR: 1 
Peak Value of UR: 3.16 
Net Charge of CR: -3 
5 Discriminant Score: 2.21 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 4.29 
Possible cleavage site: 39 
. . . positive value of mtop . . . 
10 >» Seems to have an uncleavable N-tenn signal seq. 

Amino Acid Composition of Predicted Mature Form: 

calculated from 1 
Cleavable signal was detected in ALOM? : 0B 
ALOM: finding transmembrane regions (Klein et al.) 
15 count: 3 value: -7.64 threshold: 0.5 

INTEGRAL Likelihood = -7.64 Transmembrane 21 - 37 ( 18 - 39) 
INTEGRAL Likelihood = -7.59 Transmembrane 50 - 66 ( 43 - 72) 
INTEGRAL Likelihood = -5.04 Transmembrane 79 - 95 ( 72 - 99) 
PERIPHERAL Likelihood = 2.38 
20 modified ALOM score: 2.13 

»> Likely a Type Illb membrane protein (Nexo Ccyt) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 
25 ( . 2 ) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
' motif at: 2 

Uncleavable? Ipos set to: 12 
Discrimination of mitochondrial target seq. : 
30 negative (-4.16) 

Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

35 *** Reasoning Step: 2 

Type Ilia or Illb is favored for ER memb. proteins 
Memb. protein with uncleavable signl is often at ER 
KDEL Count : 0 

40 Checking apolar signal for intramitochondrial sorting 

SKL motif (signal for peroxisomal protein) : 
pos: -1(221), count: 0 

Amino Acid Composition Tendency for Peroxisome: 5.01 

Peroxisomal proteins? Status: notclr 
45 Amino Acid Composition tendency for lysosomal proteins 
score: 2.30 Status: positive 

Type III proteins may be localized at Golgi 

Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
50 Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Check the Number of TMSs for typelll (plasma memb.) 
55 Checking N-myristoylation. . 

Final Results 

endoplasmic reticulum (membrane) Certainty^ 0.68 5 (Affirmative) < suco 

plasma membrane Certainty= 0.640 (Affirmative) < suco 

60 Golgi body Certainty^ 0*460 (Affirmative) < suco 

endoplasmic reticulum (lumen) Certainty** 0 .100 (Affirmative) < suco 
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One predicted N-glycosylation sites of SEQ ID NO: 6046 is identified at residue 4: 

Predi rtinn n fN-zbtcosvlation sites 

Position Potential Jury NGlyc 
5 agreement result 

4 NGTI 0.8430 (9/9) +++ (SEQ ID NO: 7268) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6046, wherein said fragment comprises the N-glycosylation site 

10 identified above. The invention further comprises a polynucleotide encoding one or more of the 
polypeptides identified above. 

The invention further comprises a polypeptide comprising a fragment of amino acid 
sequence SEQ ID NO: 6046, wherein said fragment does not include the N-glycosylation site 
identified above. The invention includes a polynucleotide encoding such a fragment 

15 A variant of SEQ ID NO: 6046 that is included within the invention is SEQ ID NO: 9963. 

Compared to SEQ ID NO: 6046, this sequence has Val at residue 72 instead of Ala. 

T-epitopes for SEQ ID NO: 6046 are identified in Table 20. Hie invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 

20 NOS: 8821-9018; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 

25 ID NOS: 8821-9018, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 

30 virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
35 wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 
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The invention includes a polypeptide sequence comprising SEQ ID NO: 6047 or a 
fragment thereof or an amino acid sequence having sequence identity thereto. Predicted 
transmembrane regions of SEQ ID NO: 6047 are identified below. 

Inside to outside helices : 2 found 

5 from to score center 

7 { 10) 29 ( 27) 729 17 

21 { 24) 41 ( 41) 640 34 

Outside to inside helices : 2 found 
10 from to score center 

4 ( 4) 22 ( 19) 874 12 

22 ( 24) 41 ( 41) 499 31 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 

15 6047 wherein said fragment does not include one or more of the hydrophobic amino acid 

sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected from the group consisting of 4 to 22 and 22 to 41. The invention also includes 
a polynucleotide sequence encoding any of the above-identified polypeptides. 

. SEQ ED NO: 6047 is predicted to be a hypothetical protein of SARS virus. A prediction of 

20 the protein localization of SEQ ID NO: 6047 is set forth below. SEQ ID NO: 6047 is predicted 
to be located in one of the following locations: plasma membrane, endoplasmic reticulum, Golgi 
body, and microbody (peroxisome). SEQ ID NO: 6047 may be associated with an organelle 
inside an infected cell or with viral entry to a host cell. 

. Accoidingly, SEQ ID NO: 6047 is a target for screening of chemical inhibitors to the 

25 SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6047 or a fragment 
thereof. Hie invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6047 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 
6047 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6047 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 

30 ID NO: 6047 from associating with an organelle inside of an infected cell or interacting with a 
host cell membrane. The invention includes a fusion protein wherein said fusion protein 
comprises SEQ ID NO: 6047. Predicted protein localization of SEQ ID NO: 6047 is set forth 
below. 

PSORT Prediction of Protein Localization Sites 

35 version 6. 4 (WWW) 

Species classification: 4 

*** Reasoning Step: 1 

40 Preliminary Calculation of ALOM (threshold: 0.5) 

count : 1 

Position of the most N- terminal TMS: 2 at i=l 
MT0P: membrane topology (Hartmann et al.) 

I (middle): 9 Charge dif f irence(C-N) : 0.5 
45 McG: Examining signal sequence (McGeoch) 
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Length of DR: 6 

Peak Value of DR: 3.08 

Net Charge of CR: 0 

Discriminant Score: 5.12 
5 GvH: Examining signal secfuence (von Heijne) 

Signal Score (-3.5): -4.45 

Possible cleavage site: 34 
»> Seems to have an uncleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 
10 calculated from 1 

ALOM new cnt: 1 ** thrshld changed to -2 

Cleavable signal was detected in ALOM? : 0B 

ALOM: finding transmembrane regions (Klein et al.) 

count: 1 value: -2.44 threshold: -2.0 
15 INTEGRAL Likelihood = -2.44 Transmembrane 2 - 18 ( 1 - 20) 

PERIPHERAL Likelihood = 1.22 

modified ALOM score: 0.59 
»> Seems to be a Type II (Ncyt Cexo) membrane protein 

The cytoplasmic tail is from 1 to 1 (1 Residues) 
20 Rule: vesicular pathway 

Rule: vesicular pathway 
Rule: vesicular pathway 
( 5) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seg. 
25 motif at : 5 

Uncleavable? Ipos set to: 15 
Discrimination of mitochondrial target seq. : 

notclr (1.48) 
Rule: vesicular pathway 
30 Rule: vesicular pathway 

Rule: vesicular pathway 

*** Reasoning Step: 2 

35 Relative position of the cytoplasmic tail: 1% 

Larger value (>30%) is favared for ER memb. proteins 
Memb. protein with uncleavable signl is often at ER 
KDEL Count : 0 

Checking apolar signal for intramitochondrial sorting 
W (Gavel position 15) from: 64 to: 93 Score: 30.0 

»> Seems to have an intramitochondrial signal 

SKL motif (signal for peroxisomal protein) : 
pos: -1(63), count: 0 

Amino Acid Composition Tendency for Peroxisome: 1.91 
15 Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.161 

Amino Acid Composition tendency for lysosomal proteins 
score: 0.04 Status: notclr 

Checking the consensus for Golgi 
)0 Checking the consensus for Golgi 

Checking the cytoplasmic tail of type II. (Golgi) 

Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 

Checking the 7 residue pattern for Nuclear Targeting 
>5 Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Check mitochondrial signal for typell (plasma memb. ) 

Type II is favored for plasma memb. proteins 
0 Checking the NPXY motif.. 

Checking the YXRP motif .. . 
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Checking N-myristoylation. . 
Pinal Results 

plasma membrane - — Certainty* 0.685 (Affirmative) < suco 

5 endopl asmi c reticulum (membrane) certain ty= 0.640 (Affirmative) < suco 

Golgi body Certainty^ 0.370 (Affirmative) < suco 

microbody (peroxisome) Certaintya 0.161 (Affirmative) < suco 

T-epitopes for SEQ ID NO: 6047 ate identified in Table 21. The invention includes a 
10 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9019-9131; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
15 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9019-9131, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
20 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a - 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

25 The invention provides a method of raising an immune response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6048, a fragment thereof or 

30 an amino acid sequence having sequence identity thereto. Predicted transmembrane regions of 
SEQ ID NO: 6048 are identified below. 

Inside to outside helices : 2 found 
from to score center 

3 ( 3) 18 ( 18) 1857 10 
35 100 ( 100) 117 { 115) 2904 107 

Outside to inside helices : 2 found 

from to score center 

1 ( 1) 15 ( 15) 1299 8 

40 100 ( 100) 117 { 115) 3009 107 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6048 wherein said fragment does not include one or more of the hydrophobic amino acid 
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sequences identified above. Preferably, the fragment does not include the amino acids between 
positions selected from the group consisting of 1 to 15 and 100 to 117. The invention also 
includes a polynucleotide sequence encoding any of the above-identified polypeptides. 

SEQ ID NO: 6048 is predicted to be a hypothetical protein of SARS vims. A prediction of 
5 the protein localization of SEQ ID NO: 6048 is set forth below. SEQ ID NO: 6048 is predicted 
to be located in one of the following locations: plasma membrane, Iysosome (membrane), 
microbody (peroxisome), and endoplasmic reticulum (membrane). SEQ ID NO: 6048 may be 
associated with an organelle inside an infected cell or may interact with a host cell plasma 
membrane during viral entry to the host cell. 

10 Accordingly, SEQ ID NO: 6048 is a target for screening of chemical inhibitors to the 

SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6048 or a fragment . 
thereof. The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6048 or a fragment thereof. Hie invention includes a method of screening SEQ ID NO: 
6048 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6048 

15 in a host cell. Hie invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6048 from associating with an organelle inside of an infected cell or prevents the 
polypeptide from associating with the cell membrane of a host cell. The invention includes a 
fusion protein wherein said fusion protein comprises SEQ ID NO: 6048. Predicted protein 
localization of SEQ ID NO: 6048 is set forth below. 

10 PSORT Prediction of Protein Localization Sites 

version 6. 4 (WWW) 

Species classification: 4 
*** Reasoning Step: 1 

»5 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 2 

Position of the most N-terminal TMS: 3 at i=2 
MTOP: membrane topology (Hartmann et al.) 
0 I (middle): 10 Charge dif f irence(C-N) : -2.5 

McG: Examining signal sequence (McGeoch) 
Length of UR: 13 
Peak Value of UR: 3.38 
Net Charge of CR: 1 
5 Discriminant Score: 10.02 

GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 2.56 
Possible cleavage site: 15 
»> Seems to have a cleavable N-term signal seq. 
J Amino Acid Composition of Predicted Mature Form: 
calculated from 16 
ALOM new cnt: 2 ** thrshld changed to -2 
Cleavable signal was detected in ALOM? : IB 
ALOM: finding transmembrane regions (Klein et al.) 
5 count: 1 value: -14.75 threshold: -2.0 

INTEGRAL Likelihood =-14.75 Transmembrane 101 - 117 ( 95 - 120) 
PERIPHERAL Likelihood = 6.63 
modified ALOM score: 3.05 
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»> Seems to be a Type la membrane protein 

The cytoplasmic tail is from 118 to 122 (5 Residues) 
Rule: vesicular pathway 
Rule: vesicular pathway 
5 Rule: vesicular pathway 
(15) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 15 
Uncleavable? Ipos set to: 25 
10 Discrimination of mitochondrial target seq. : 

notclr (0.73) 
Rule: vesicular pathway 
Rule: vesicular pathway 
Rule: vesicular pathway 

15 

*** Reasoning Step: 2 
KDKL Count: 0 

Checking apolar signal for intramitochondrial sorting 
20 (Gavel position 25) from: 3 to: 12 Score: 8.5 

SKL motif (signal for peroxisomal protein) : 
pos: -1(122), count: 0 

Amino Acid Composition Tendency for Peroxisome: 2.46 
AAC not from the N-term. , score modified 
25 Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.115 

Amino Acid Composition tendency for lysosomal proteins 
score: -0.40 Status: negative 

GY motif in the tail of typela? (lysosomal) 
JO Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 

Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 
$5 Nuclear Signal Status: negative (0.00) 

Type la is favored for plasma memb. proteins 

Checking the NPXY motif.. 

Checking the YXRF motif.. 

Checking N-myristoylation. . 
K) Checking GPI anchor.. 

»> Seems to be GPI-anchored (0.85) 

Final Results 

plasma membrane Certainty^ 0 .919 (Affirmative) < suco 

\5 lysosctme (membrane) Certainty^ 0.200 (Affirmative) < suco 

microbody (peroKisome) — Certainty* 0.115 (Affirmative) < suco 
endoplasmic reticulum (membrane) Certainty* 0.100 (Affirmative) < suco 

T-epitopes for SEQ ID NO: 6048 are identified in Table 22. The invention includes a 
0 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9132-9308; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
5 of such polynucleotides through viral vectors and/or viral particles. The invention further 
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comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9132-9308, or a polynucleotide encoding such a polypeptide. 

Hie use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
5 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. Hie use preferably protects or treats disease and/or infection caused by a SARS 
virus. Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

10 Hie invention provides a method of raising an immune response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. Hie immune response is preferably protective or therapeutic. 

The invention includes a polypeptide comprising SEQ ID NO: 6049, a fragment thereof or 

15 an amino acid sequence having sequence identity thereto. Predicted transmembrane or 
hydrophobic regions of SEQ ID NO: 6049 are identified below. 

Inside to outside helices : 1 found 
from to score center 

13 ( 13) 30 ( 28) 3532 20 

20 

Outside to inside helices : 1 found 
from to score center 

9 ( 11) 29 ( 26) 3395 19 

25 Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 

6049 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

SEQ ID NO: 6049 is predicted to be a hypothetical protein of SARS virus. A prediction of 

30 the protein localization of SEQ ID NO: 6049 is set forth below. SEQ ID NO: 6049 is predicted 
to be located in one of the following locations: outside, microbody (peroxisome), endoplasmic 
reticulum (membrane) and endoplasmic reticulum Oumen). The highest ranking indicates that 
SEQ ID NO: 6049 is located on the outside of a cell. Accordingly, SEQ ID NO: 6049 may be a 
surface exposed protein. 

J5 Accordingly, SEQ ID NO: 6049 may be used in an immunogenic composition to raise an 

immune response against the SARS virus. It also may be used to generate antibodies specific to 
the SARS virus. Such antibodies may be used in a method of treatment or prevention of a SARS 
virus infection. Such antibodies may further be used in a diagnostic test to identify the presence 
or absence of SARS virus in a biological sample. 
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The invention includes a polypeptide comprises SEQ ID NO: 6049 or a fragment thereof. 
The invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID NO: 
6049 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 6049 for 
an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6049 in a host 
5 cell. The invention includes a fusion protein wherein said fusion protein comprises SEQ ID NO: 
6049. Predicted protein localization of SEQ ID NO: 6049 is set forth below. 

PSORT Prediction of Protein Localization Sites 

version 6.4 (WWW) 

Species classification: 4 

10 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 1 

15 Position of the most N-terminal TMS: 11 at i=l 

MTOP: membrane topology (Hartmann et al.) 

I (middle): 18 Charge dif f irence (C-N) : -2.0 
McG: Examining signal sequence (McGeoch) 
Length of UR: 24 
20 Peak Value of UR: 3.69 

Net Charge of CR: -2 
Discriminant Score: 13.56 
GvH: Examining signal sequence (von Heijne) 
Signal Score (-3.5): 0.52 
15 Possible cleavage site: 25 

»> Seems to have a cleavable N-term signal seq. 
Amino Acid Composition of Predicted Mature Form: 

calculated from 26 
ALOM new cnt: 1 ** thrshld changed to -2 
JO Cleavable signal was detected in ALOM? : IB 

ALOM: finding transmembrane regions (Klein et al.) 
count: 0 value: 14.80 threshold: -2.0 
PERIPHERAL Likelihood =14.80 
' modified ALOM score: -3.86 
15 Rule: vesicular pathway 

Rule: vesicular pathway 
Rule: vesicular pathway 
( 2) or uncleavable? 

Gavel: Examining the boundary of mitochondrial targeting seq. 
\0 motif at: 2 

Uncleavable? Ipos set to: 12 
Discrimination of mitochondrial target seq. : 

notclr (1.42) 
Rule: vesicular pathway 
■5 Rule: vesicular pathway 
Rule: vesicular pathway 

*** Reasoning Step: 2 

0 KDEL Count: 0 

Number of Potential N-glycosylation Sites: 0 
Out: score 0.800 

Checking apolar signal for intramitochondrial sorting 
(Gavel position 12) from: 44 to: 73 Score: 30.0 
5 »> Seems to have an intramitochondrial sig na l 

SKL motif (signal for peroxisomal protein) : 
pos: -1(44), count: 0 
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Amino Acid Composition Tendency for Peroxisome: 9.47 

AAC not from the N-term. , score modified 
Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.320 
5 Amino Acid Composition tendency for lysosomal proteins 

score: -6.47 Status: negative 
Number of NX(S/T) motif: 0 

Checking the amount of Basic Residues (nucleus) 

Checking the 4 residue pattern for Nuclear Targeting 
10 Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 

Checking the RNA binding motif (nucleus or cytoplasm) 

Nuclear Signal Status: negative ( 0.00) 

Checking CaaX motif . . 
15 Checking N-myristoylation. . 

Checking CaaX motif.. 

Pinal Results 

outside Certainty^ 0.820 (Affirmative) < suco 

20 microbody (peroxisome) Certainty^ 0.320 (Affirmative) < suco 

endoplasmic reticulum (membrane) Certainty^ 0.100 (Affirmative) < suco 

endoplasmic reticulum (lumen) Certainty? 0.100 (Affirmative) < suco 

T-epitopes for SEQ ID NO: 6049 are identified in Table 23. The invention includes a 
25 polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 

sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9309-9437; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 
polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
30 of such polynucleotides through viral vectors and/or viral particles. The invention further 

comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS : 9309-9437, or a polynucleotide encoding such a polypeptide. 

The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
35 as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

40 The invention provides a method of raising an immune response in a mammal (typically a 

human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. The immune response is preferably protective or therapeutic. 
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Hie invention includes a polypeptide comprising SEQ ID NO: 6050 or a fragment thereof 
or an amino acid sequence having sequence identity thereto. Predicted transmembrane or 
hydrophobic regions are identified below. 

Inside to outside helices : 1 found 
5 from to score center 

13 ( 15) 32 ( 30) 558 23 

Outside to inside helices : 1 found 
from to score center 

10 16 ( 16) 30 ( 30) 364 23 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
6050 wherein said fragment does not include one or more of the hydrophobic amino acid 
sequences identified above. Hie invention also includes a polynucleotide sequence encoding any 

15 of the above-identified polypeptides. 

SEQ ID NO: 6050 is predicted to be a hypothetical protein of SARS virus. A prediction of 
the protein localization of SEQ ID NO: 6050 is set forth below. SEQ ID NO: 6050 is predicted 
to be located in one of the following locations: lysosome (lumen), mitochondrial matrix space, 
mitochondrial inner membrane, and mitochondrial intennembrane space. SEQ ID NO: 6050 

20 may be associated with an organelle inside an infected cell during the viral replication cycle. 

Accordingly, SEQ ID NO: 6050 is a target for screening of chemical inhibitors to the 
SARS virus. The invention includes a polypeptide comprises SEQ ID NO: 6050 or a fragment 
thereof. Hie invention includes a polynucleotide encoding the polypeptide sequence of SEQ ID 
NO: 6050 or a fragment thereof. The invention includes a method of screening SEQ ID NO: 

25 6050 for an inhibitor. The invention includes the recombinant expression of SEQ ID NO: 6050 
in a host cell. The invention includes a small molecule which prevents the polypeptide of SEQ 
ID NO: 6050 from associating with an organelle inside of an infected cell or prevents the 
polypeptide from associating with the cell membrane of a host cell. Hie invention includes a 
fusion protein wherein said fusion protein comprises SEQ ID NO: 6050. Predicted protein 

10 localization of SEQ ID NO: 6050 is set forth below. 

PSORT Prediction of Protein Localization Sites 

version 6.4 (WWW) 

MYSEQ 84 Residues 

Species classification: 4 

15 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 0 

0 McG: Examining signal sequence (McGeoch) 

Length of UR: 3 

Peak Value of UR: 1.46 

Net Charge of CR: 2 

Discriminant Score: -5.73 
5 GvH: Examining signal sequence (von Heijne) 
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Signal Score (-3.5): -0.12 
Possible cleavage site: 29 
»> Seems to have no N-terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
5 calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: 0B 
ALOM: finding transmembrane regions (Klein et al.) 
count: 0 value: 8.43 threshold: -2.0 
10 PERIPHERAL Likelihood = 8.43 

modified ALOM score: -2.59 
Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 61 
ARCWYL 

15 Discrimination of mitochondrial target seq. : 

positive ( 1.66) 

Rule: mitochondrial protein 

Rule: mitochondrial protein 

Rule: mitochondrial protein 
20 Rule: mitochondrial protein 

*** Reasoning Step: 2 

KDEL Count: 0 

25 Checking apolar signal for intramitochondrial sorting 

(Gavel position 61) from: 52 to: 58 Score: 6.0 
Mitochondrial matrix? Score: 0.38 
* SKL motif (signal for peroxisomal protein) : 
pos: -1(84), count: 0 
30 Amino Acid Composition Tendency for Peroxisome: 1.47 
Peroxisomal proteins? Status: notclr 

AAC score (peroxisome): 0.263 
Amino Acid Composition tendency for lysosomal proteins 
score T 2.86 Status: positive 
35 Modified score for lysosome: 0.850 

Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Checking, the 7 residue pattern for Nuclear Targeting 
Checking the Robbins & Dingwall consensus (nucleus) 
40 Checking the RNA binding motif (nucleus or cytoplasm) 
Nuclear Signal Status: negative ( 0.00) 
Checking CaaX motif. . 
Checking N-myristoylation. . 
Checking CaaX motif . . 

45 

Pinal Results 

lysosome (lumen) Certainty^ 0.850 (Affirmative) < suco 

mitochondrial matrix space Certainty^ 0 . 544 (Affirmative) < suco 

mitochondrial inner membrane Certainty^ 0.266 (Affirmative) < suco 

50 mitochondrial intermembrane space Certainty* 0.266 (Affirmative) < suco 

One predicted N-glycosylation sites of SEQ ID NO: 6050 is identified at residue 43: 

Position Potential Jury NGlyc 

agreement result 
55 43 NVTI 0.6713 (9/9) ++ (SEQ ID NO: 7269) 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6050 wherein said fragment comprises the N-glycosylation site 
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identified above. The invention further comprises a polynucleotide encoding one or more of the 
polypeptides identified above. 

Hie invention further comprises a polypeptide comprising a fragment of amino acid 
sequence SEQ ID NO: 6050 wherein said fragment does not include the N-glycosylation site 
5 identified above. Hie invention includes a polynucleotide encoding such a fragment 

T-epitopes for SEQ ID NO: 6050 are identified in Table 24. Hie invention includes a 
polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9438-9538; (b) an amino acid sequence having sequence identity to an amino acid 

10 sequence of (a). Hie invention further comprising a polynucleotide sequence encoding the 

polypeptides of (a) or (b). Hie invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9438-9538, or a polynucleotide encoding such a polypeptide. * 

15 The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 

complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
virus. The invention provides the use of a polypeptide in the manufacture of a medicament for 

20 immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 
wherein said immune response is a cell-mediated immune response and, preferably, a CTL 

15 response. The immune response is preferably protective or therapeutic. 

The invention includes a polypeptide sequence comprising SEQ ID NO: 6051 or a 
fragment thereof or an amino acid sequence having sequence identity thereto. The invention 
includes a polypeptide sequence comprising SEQ ID NO: 6052 or a fragment thereof or an 
amino acid sequence having sequence identity thereto. 

50 SEQ ID NO: 6051 and SEQ ID NO: 6052 demonstrate functional homology with a 

nucleocapsid protein of a coronavirus. The invention includes a diagnostic kit comprising a 
polypeptide comprising SEQ ID NO: 6051, SEQ ID NO: 6052 or a fragment thereof. The 
invention includes a diagnostic kit comprising a polynucleotide encoding SEQ ID NO: 6051, 
SEQ ID NO: 6052 or a fragment thereof. The invention includes an immunogenic composition 

55 comprising SEQ ID NO: 6051, SEQ ID NO: 6052 or a fragment thereof. The invention includes 
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an antibody which recognizes a polypeptide comprising SEQ ID NO: 6051, SEQ ID NO: 6052 
or a fragment thereof. 

SEQ ID NO: 6051 is predicted to be phosphorylated at Ser-79; Thr-92; Ser-106; Tlir-116; 
Hir-142; Ser-184; Ser-188; Ser-202; Ser-236; TTir-248; Ser-251; Ser-256; Thr-377. 

5 Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 6051 
wherein said fragment includes one or more of the amino acid residues of SEQ ID NO: 6051 
selected from the group consisting of Ser-79; Thr-92; Ser-106; Thr-116; Thr-142; Ser-184; Ser- 
188; Ser-202; Ser-236; Thr-248; Ser-251; Ser-256; Thr-377. The invention further includes a 
polypeptide comprising a fragment of SEQ ID NO: 6051 wherein said fragment does not include 

10 one or more of the amino acid residues of SEQ ID NO: 605 1 selected from the group consisting 
of Ser-79; Thr-92; Ser-106; Thr-116; Thr-142; Ser-184; Ser-188; Ser-202; Ser-236; Thr-248; 
Ser-251; Ser-256; Thr-377. Two further useful fragments of the N protein (e.g. for 
immunoassay) are SEQ ID NOS: 9783 & 9784, which are lysine-rich and can be used to 
distinguish the SARS virus from other coronaviruses. 

15 Predicted transmembrane regions of SEQ ID NO: 6051 are identified below. 

Inside to. outside helices 1 found 
from to score center 

304 ( 304) 323 ( 319) 495 312 

20 Outside to inside helices : 2 found 

from to score center 

304 { 304) 319 ( 319) 597 312 * 

Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID NO: 
25 605 1 wherein said fragment does not include one or more of the hydrophobic amino acid 

sequences identified above. The invention also includes a polynucleotide sequence encoding any 
of the above-identified polypeptides. 

Predicted protein localization of SEQ ED NO: 6051 is set forth below. SEQ ID NO: 6051 
is predicted to be localized near the nucleus, lysosome (lumen), mitochondrial matrix space, and 
30 microbody (peroxisome). The highest ranking is for localization near the nucleus. Coronavirus 
nucleocapsid proteins are known to bind to viral RNA. Coronavirus nucleocapsid proteins are 
also thought to be important for cell mediated immunity. Accordingly, the invention includes a 
polynucleotide comprising SEQ ID NO: 6051. Hie invention further includes a viral vector or 
particle suitable for in vivo delivery of the polynucleotide sequence comprising a SARS virus 
35 nuceocapsid polynucleotide sequence or a fragment thereof. In one embodiment, the 

polynucleotide comprises SEQ ID NO: 6051 or a fragment thereof. The invention further 
includes a method for eliciting a cell mediated immune response comprising delivering a 
polynucleotide encoding a SARS virus nucleocapsid protein or a fragment thereof to a mammal. 
In one embodiment, the polynucleotide comprising SEQ ID NO: 6051 or a fragment thereof. 
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Hie invention further includes a method of screening SEQ ID NO: 6051 for an inhibitor. 
The invention includes the recombinant expression of SEQ ID NO: 6051 in a host cell. The 
invention includes a small molecule which prevents the polypeptide of SEQ ID NO: 6051 from 
binding to SARS virus RNA during viral replication. The invention includes a fusion protein 
5 wherein said fusion protein comprises SEQ ID NO: 6051. Predicted protein localization of SEQ 
ID NO: 6051 is set forth below. 

PSORT Prediction of Protein Localization Sites 

version 6. 4 (WWW) 

Species classification: 4 

10 

*** Reasoning Step: 1 

Preliminary Calculation of ALOM (threshold: 0.5) 
count : 0 

15 McG: Examining signal sequence (McGeoch) 

Length of OR: 3 
Peak. Value of UR: 0.19 
Net Charge of CR: 0 
Discriminant Score: -15.98 
20 GvH: Examining signal sequence (von Heijne) 

Signal Score (-3.5): -6.36 
Possible cleavage site: 58 
»> Seems to have no N- terminal signal seq. 
Amino Acid Composition of Predicted Mature Form: 
25 • calculated from 1 

ALOM new cnt: 0 ** thrshld changed to -2 
Cleavable signal was detected in ALOM?: 0B 
ALOM: finding transmembrane regions (Klein et al.) 
count: 0 value: 5.04 threshold: -2.0 
30 PERIPHERAL Likelihood =5.04 

modified ALOM score: -1.91 
. Gavel: Examining the boundary of mitochondrial targeting seq. 
motif at: 17 
PRITFG 

35 Discrimination of mitochondrial target seq. : 

negative (-3.97) 

*** Reasoning Step: 2 

K) KDEL Count: 0 

Checking apolar signal for intramitochondrial sorting 
Mitochondrial matrix? Score: 0.10 
SKL motif (signal for peroxisomal protein) : 
pos: -1(399), count: 0 
15 Amino Acid Composition Tendency for Peroxisome: 0.04 
Peroxisomal proteins? Status: notclr 

AAC score (peroxisome) : 0.072 
Amino Acid Composition tendency for lysosomal proteins 
score: 0.96 Status: notclr 
JO Modified score for lysosome: 0.246 

Checking the amount of Basic Residues (nucleus) 
Checking the 4 residue pattern for Nuclear Targeting 
Pound: pos: 256 (4) KKPR 
Pound: pos: 372 (5) KKKK 
!5 Checking the 7 residue pattern for Nuclear Targeting 

Checking the Robbins & Dingwall consensus (nucleus) 
Pound: pos: 372 (3) KK KKTDEAQPLP QRQKK 
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Found: pos: 373 (3) KK KTDEAQPLPQ RQKKQ 
Final Robbins Score (nucleus): 0.80 
Checking the KNA binding motif (nucleus or cytoplasm) 
nuc modified. Score: 0.90 
5 Nuclear Signal Status: positive ( 0.90) 

Checking CaaX motif. . 
Checking N-myristoylation. . 
Checking CaaX motif. . 

10 Final Results 

nucleus Cextainty= 0.980 (Affirmative) < suco 

lysosame (lumen) Certainty- 0.246 (Affirmative) < suco 

mitochondrial matrix space Certainty 0.100 (Affirmative) < suco 

^ microbody (peroxisome) Certainty^ 0.072 (Affirmative) < suco 

Predicted N-glycosylation sites of SEQ ID NO: 6051 are identified below. 

Position Potential Jury NGlyc 

agreement result 

48 NNTA 0.6879 (9/9) ++ (SEQ ID NO: 7270) 

20 270 NVTQ 0.7684 (9/9) +++ (SEQ ID NO: 7271) 

Residue No. Potential Threshold Assignment 
Thr 166 0.8547 0.6439 T 

Thr 367 0.5575 0.5403 T 

15 Thr 394 0.8217 0.5821 T 

Accordingly, the invention comprises a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 6051 wherein said fragment comprises one or more of the N- 
glycosylation sites identified above. The invention further comprises a polynucleotide encoding 
\0 one or more of the polypeptides identified above. 

The invention further comprises a polypeptide comprising a fragment of amino acid 
sequence SEQ ID NO: 6051 wherein said fragment does not include one or more of the 
N-glycosylation sites identified above. The invention includes a polynucleotide encoding such a 
fragment 

5 T-epitopes for SEQ ID NO: 6052 are identified in Table 25. The invention includes a 

polypeptide for use as an antigen, wherein the polypeptide comprises: (a) an amino acid 
sequence selected from the group consisting of the T-epitope sequences identified in SEQ ID 
NOS: 9539-9752; (b) an amino acid sequence having sequence identity to an amino acid 
sequence of (a). The invention further comprising a polynucleotide sequence encoding the 

0 polypeptides of (a) or (b). The invention further comprising a method of expression or delivery 
of such polynucleotides through viral vectors and/or viral particles. The invention further 
comprises a polypeptide comprising two or more of the T-epitope sequences identified in SEQ 
ID NOS: 9539-9752, or a polynucleotide encoding such a polypeptide. 

A variant of SEQ ID NO: 6052 that is included within the invention is SEQ ID NO: 9964. 

5 Compared to SEQ ID NO: 6052, this sequence has lie at residue 54 instead of Thr. 
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The use as an antigen is preferably a use: (1) as a T-cell antigen; (2) for generating a 
complex between a class I MHC protein (e.g. a class I HLA) and a fragment of said antigen; (3) 
as an antigen for raising a cell-mediated immune response; and/or (4) as an antigen for raising a 
CTL response. The use preferably protects or treats disease and/or infection caused by a SARS 
5 virus. Hie invention provides the use of a polypeptide in the manufacture of a medicament for 
immunising a mammal (typically a human) against SARS viral infection wherein the polypeptide 
is as defined above. 

The invention provides a method of raising an immune response in a mammal (typically a 
human), comprising the step of administering to the mammal a polypeptide as defined above, 

10 wherein said immune response is a cell-mediated immune response and, preferably, a CTL 
response. Hie immune response is preferably protective or therapeutic. 

The invention includes a composition comprising a SARS virus nucleocapsid protein or a 
fragment thereof and further comprising a SARS virus membrane protein or a fragment thereof. 
The composition may further comprising one or more adjuvants discussed below. 

15 The invention further includes a composition comprising a polypeptide comprising SEQ 

ID NO: 6051 or a fragment thereof or a sequence having sequence identity thereto and further 
comprising a polypeptide comprising SEQ ID NO: 6040, or a fragment thereof or a sequence 
having sequence identity thereto. Such composition may be used, for instance, in a vaccine. 
Such composition may further comprise one or more adjuvants discussed below. 

20 The invention includes a composition comprising a SARS virus nucleocapsid protein or a 

fragment thereof and a SARS virus spike protein or a fragment thereof. In one embodiment the 
nucleocapsid protein comprises a polypeptide sequence comprising SEQ ID NO: 6051 or a 
fragment thereof or a sequence having sequence identity thereto. In one embodiment, the spike 
protein comprises a polynucleotide comprising SEQ ID NO: 6042 or a fragment thereof or a 

25 sequence having sequence identity thereto. Hie composition may further comprise one or more 
of the adjuvants discussed below. 

The invention further includes a composition comprising antibodies specific to a SARS 
virus nucleocapsid protein and comprising antibodies specific to a SARS virus spike protein. In 
one embodiment the antibody is specific to a nucleocapsid protein comprises a polypeptide 

30 sequence comprising SEQ ID NO: 6051 or a fragment thereof or a sequence having sequence 
identity thereto. In one embodiment, the antibody is is specific to a spike protein comprises a 
polynucleotide comprising SEQ ID NO: 6042 or a fragment thereof or a sequence having 
sequence identity thereto. 

The invention further includes polynucleotide sequences, and fragments thereof, of a 

\5 SARS virus which are conserved among coronaviruses, and polypeptides encoded thereby. Such 
conserved sequences can be identified in the alignments shown in FIGURE 7. Such conserved 

-71- 



WO 2004/092360 PCMJS2004/011710 

sequences may be used in the vaccines of the invention or in the diagnostic reagents, kits and 
methods of the invention. 

Hie invention further includes polynucleotide sequences, and fragments thereof, of a 
SARS virus which are specific to SARS virus and not shared with coronaviruses. Such SARS 
5 specific sequences are also identified as SEQ ID NOS: 6040, 6043, 6044, 6047, 6048, 6049 and 
6050. Such SARS specific sequences may be used in the vaccines of the invention or in the 
diagnostic reagents, kits and methods of the invention. 

The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
10 to diagnose or identify the presence or absence of a SARS virus in a biological sample. Hie 
invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified inSEQ ID NOS: 6076-6265 (Table 5). Hie invention further includes polynucleotide 
sequence comprising the complement of one or more of the primer sequences identified in SEQ 
ID NOS: 6076-6265. 

15 The invention also includes polynucleotide sequences which can be used as probes or 

primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 
invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6266-6343 (Table 6). The invention further includes polynucleotide 

20 sequence comprising the complement of one or more of the primer sequences identified in SEQ . 
ID NOS: 6266-6343. 

The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used - 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. The 

15 invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6344-6392 (Table 7). The invention further includes polynucleotide 
sequence comprising the complement of one or more of the primer sequences identified in SEQ 
ID NOS: 6344-6392.. 

The invention also includes polynucleotide sequences which can be used as probes or 

!0 primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. Hie 
invention includes a polynucleotide sequence comprising one or more of the primer sequences 
identified in SEQ ID NOS: 6393-6559 (Tables 8 & 9). The invention further includes 
polynucleotide sequence comprising the complement of one or more of the primer sequences 

;5 identified in SEQ ID NOS: 6393-6559. 
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The invention also includes polynucleotide sequences which can be used as probes or 
primers for diagnostic reagents, kits (comprising such reagents) and methods which can be used 
to diagnose or identify the presence or absence of a SARS virus in a biological sample. Hie 
invention includes a polynucleotide sequence comprising one or more of the primer and probe 
5 sequences identified in SEQ ID NOS: 6560-6568. The invention farther includes polynucleotide 
sequence comprising the complement of one or more of the primer sequences identified in SEQ 
ID NOS: 6560-6568. 

Hie invention includes a polypeptide sequence comprising any one of even-numbered SEQ 
ID NOS: 7272-7290, or a fragment thereof, or a sequence having sequence identity thereto. Hie 
10 invention further includes a polynucleotide sequence encoding any one of even-numbered SEQ 
ID NOS: 7272-7290, or a fragment thereof, or a sequence having sequence identity thereto. 
Examples of such polynucleotide sequences are odd-numbered SEQ ID NOS: 7273-7291. 

The invention includes a polynucleotide sequence comprising an intergenic sequence 
which is common to each open reading frame of the SARS virus. The SARS virus is thought to 
15 use this sequence to signal translation of the open reading frame. The intergenic sequence 

comprises a lQmer SEQ ID NO: 7292, or optionally a hexamer SEQ ID NO: 7293. When the 
virus transcribes its positive (+) RNA strand to (-) RNA strand, the virus replicating structure 
uses the (-) strand template to transcribe nucleotides at the 5 f end prior to the first intergenic 
sequence, followed by the intergenic sequence, followed by the selected open reading frame. 
10 The virus then creates multiple mRNAs comprising the 5 1 end, the intergenic sequence and 
coding sequence. For more details on Nidovriales replication (including Coronavims) see e.g., 
Ziebuhr et al. y "Virus-encoded proteinases and proteolytic processing in the Nidovirales", 
Journal of General Virology 81 :853-879 (2000), incorporated herein by reference in its entirety. 
Hie invention comprising a polynucleotide sequence comprising SEQ ID NO: 7292 or the 
15 complement thereof. The invention comprising a polynucleotide sequence comprising SEQ ID 
NO: 7293 or the complement thereof. Hie invention further comprises a polynucleotide 
sequence comprising nucleotides from the 5' end of the SARS viral genome, or its reverse 
complement, and further comprising an intergenic sequence or its reverse complement. The 
polynucleotide may further comprise one or more of the SARS virus open reading frames. 
0 Examples of polynucleotide sequences comprising nucleotides from the 5' end of the SARS virus 
genome followed by the intergenic sequence are SEQ ID NOS: 7294-7301.- 

The invention includes a polynucleotide sequence comprising a sequence selected from the 
group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 
7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID 
5 NO: 7300 and SEQ ID NO: 7301, or a fragment thereof, or a sequence having sequence identity 
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thereto. In one embodiment, the polynucleotide does not consist entirely of a known SARS virus 
sequence. 

The SARS virus intergenic sequence can be used to create a RNAi molecule. Such a 
SARS virus specific RNAi molecule can be used to treat SARS virus infection. Hie invention 
5 includes a RNAi molecule comprising a double stranded RNA molecule wherein one RNA 
strand comprises a sequence selected from the group consisting of SEQ ID NO: 7292, SEQ ID 
NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ 
ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301, or a fragment 
thereof. Preferably, said RNA strand comprises a sequence selected from the group consisting of 
10 SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the other RNA strand comprises the 

reverse complement of the first strand or a polynucleotide sequence which hybridizes to the first 
strand. 

The invention includes the use of RNAi in a method of treatment for SARS virus infection 
comprising administering to a mammal an effective amount of the si RNA molecule. Preferably, 

15. the RNAi molecule comprises the molecule described above. Further discussion of the RNAi 
applications of the intergenic sequence is included in section IV of the specification below. 

The invention also includes the use of a SARS virus antisense nucleotide sequence, 
preferably antisense directed to the SARS virus intergenic sequence. Such an antisense sequence 
may be used in the treatment of a subject infected with the SARS virus. The antisense of the 

20 SARS virus intergenic sequence can be designed to bind to the SARS viral polynucleotides to 
block access of the viral replication machinery to the intergenic sequence. Such an antisense 
sequence may also be used to identify the presence or absence of a SARS virus in a biological 
sample. The antisence can itself be labeled or the antisense associated with viral polynucleotides 
can be detected by means known in the art 

15 Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 

formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 
sequence can interfere with expression of the corresponding gene. Antisense polynucleotides 
will bind and/or interfere with the translation of the corresponding mRNA. 

JO The invention also includes the use of the intergenic region with a ribozyme. 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and the 
target message must contain a specific nucleotide sequence. They are engineered to cleave any 
RNA species site-specifically in the background of cellular RNA. The cleavage event renders the 

\5 mRNA unstable and prevents protein expression. Importantly, ribozymes can be used to inhibit 
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expression of a gene of unknown function for the purpose of determining its function in an in 
vitro or in vivo context, by detecting the phenotypic effect 

One commonly used ribozyme motif is the hammerhead, for which the substrate sequence 
requirements are minimal. Design of the hammerhead ribozyme is disclosed in Usman et aL 9 

5 Current Opin. Struct. Biol (1996) 6:527-533. Usman also discusses the therapeutic uses of 
ribozymes. Ribozymes can also be prepared and used as described in Long et oi, FASEB J. 
(1993) 7:25; Symons, Arm. Rev. Biochem. (1992) 61:641; Perrotta et at, Biochem. (1992) 31:16- 
17; Ojwang et aL % Proc. Natl Acad. ScL (USA) (1992) 89:10802-10806; and US Patent 
5,254,678. Ribozyme cleavage of HIV-I RNA is described in US Patent 5,144,019; methods of 

10 cleaving RNA using ribozymes is described in US Patent 5,1 16,742; and methods for increasing 
the specificity of ribozymes are described in US Patent 5,225,337 and Koizumi et aL, Nucleic 
Acid Res. (1989) 17:7059-7071. Preparation and use of ribozyme fragments in a hammerhead 
structure are also described by Koizumi et al , Nucleic Acids Res. (1989)17:7059-7071.: 
Preparation and use of ribozyme fragments in a hairpin structure are described by Chowrira & 

15 Burke, Nucleic Acids Res. (1992) 20:2835. Ribozymes can also be made by rolling transcription 
as described in Daubendiek & Kool, Nat. Biotechnol. (1997) 15(3):273-277. 

The hybridizing region of the ribozyme may be modified or may be prepared as a branched 
structure as described in Horn & Urdea, Nucleic Acids Res. (1989) 17:6959-67. The basic 
structure of the ribozymes may also be chemically altered in ways familiar to those skilled in the 

20 art, and chemically synthesized ribozymes can be administered as synthetic oligonucleotide 

derivatives modified by monomelic units. In a therapeutic context, liposome mediated delivery • 
of ribozymes. improves cellular uptake, as described in Birikh et al , Eur. J. Biochem. (1997) 
245:1-16. 

Therapeutic and functional genomic applications of ribozymes proceed beginning with 
25 knowledge of a portion of the coding sequence of the gene to be inhibited. In the present 

invention, the target sequence preferably comprises the intergeneic sequence of the SARS virus. 
Preferably, the sequence is selected from the group consisting of SEQ ID NO: 7292 and SEQ ID 
NO: 7293. A target cleavage site is selected in the target sequence, and a ribozyme is 
constructed based on the 5 1 and 3* nucleotide sequences that flank the cleavage site. Preferably, 
30 the 5' nucleotide sequence includes the 5' untranslated region of the SARS virus. The ribozyme 
may then further be constructed from one or more of the polynucleotide sequences selected from 
the group consisting of SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 
7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. 
Antisense treatment of HTV infection is described in the following references, each of 
35 which is incorporated herein by reference in their entirety, (antisense RNA complementary to 
the mRNA of gag, tat, rev, env) (Sezakiel et aL, 1991, J. Virol. 65:468-472; Chatterjee et aL 9 
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1992, Science 258:1485-1488; Rhodes etal, 1990, J. Gen. Virol. 71:1965. Rhodes etaL, 1991, 
AIDS 5:145-151; Sezakiel etaL, 1992, J. Virol. 66:5576-5581; Joshi etaL, 1991, J. Virol. 
65:5524-5530). 

The invention includes the use of decoy RNA to disrupt the S ARS virus replication and 

5 life cycle. Methods of making and using such decoy RNA for treatment of a viral infection are 
known in the art The invention includes delivery of genes encoding, for example, the S ARS 
virus intergenic sequence, to infected cells. Preferably, the sequence comprises one or more of 
the sequences selected from the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ 
ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, 

10 SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. Preferably, the sequence 

comprises one or more of the sequences selected from the group consisting of SEQ ID NO: 7292 
and SEQ ID NO: 7293. Preferably, the sequence comprises SEQ ID NO: 7293. 

In the present invention, delivery of intergenic sequence which is not linked to the SARS 
virus open reading frames disrupts the translation process of the viral RNA and decreases the 

15 production of vial proteins. Similar methods of treatment for HIV viral infection have been 
described The following references discuss the use of decoy RNA of HIV TAR orRRE for 
treatment of HIV infection. Each of these references is incorporated herein by reference in their 
entirety. (SuUenger et a/., 1990, Cell 63:601-608; SuUenger et a/., 1991, J. Virol. 65:6811-6816; 
lisziewicz et aL, 1993, New Biol. 3:82-89; Lee et a/., 1994, J. Virol. 68:8254-8264), ribozymes ♦ 

20 (Sarver et a£, 1990, Science 247:1222-1225; Wecrasinghe et al. y 1991, J. Virol. 65:5531-5534; J 
Dropulic et al. y 1992, J. Virol. 66:1432-1441; Ojwang et aL, 1992, Proc. Nad. Acad. Sci. USA. 
89:10802-10806; Yu etal., 1993, Proc. Natl. Acad. Sci. USA. 90:6340-6344; Yu ^ ai, 1995, 
Proc. Natl. Acad. Sci. USA. 92:699-703; Yamada et al. y 1994, Gene Therapy 1:38-45). 
The invention includes the use of the SARS virus intergenic sequence in diagnostic 

25 reagents, kits (comprising such reagents) and methods which can be used to diagnose or identify 
the presence or absence of a SARS virus in a biological sample. Such diagnostic reagents, kits, 
and methods are further discussed in Section II of the specification. 

The invention includes a pair of primers for amplifying a SARS polynucleotide sequence 
comprising (i) a first primer comprising a sequence which is substantially identical to a portion 

30 of a sequence selected from the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ 
ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, 
SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301 and (ii) a second primer 
comprising a sequence which is substantially complementary to a portion of a sequence selected 
from the group consisting of the sequence SEQ ID NO: land the sequence SEQ ID NO: 2, such 

35 that the primer pair (i) and (ii) defines a template sequence within a sequence from the group 
consisting of the sequence SEQ ID NO: land the sequence SEQ ID NO: 2. Preferably, the (i) 



WO 2004/092360 



PCT/US2004/011710 



first prima- comprises a sequence which is substantially identical to a portion of a sequence 
selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the 
(i) first primer comprises a sequence which is substantially identical to a portion of the sequence 
of SEQ ID NO: 7293. The amplicon defined by said first and second primers is preferably 

5 between 50 and 250 nucleotides in length. Hie primers may optionally be labeled to facilitate 
their detection. Methods and compositions for use in labeling primers are discussed further in 
the application in Section DL 

Hie invention further includes a pair of primers for amplifying a SARS polynucleotide 
sequence comprising (i) a first primer comprising a sequence which is substantially identical to a 

10 portion of the complement of a portion of a sequence selected from the group consisting of SEQ 
ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, 
SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 
7301 and (ii) a second primer comprising a sequence which is substantially complementary to a ■ 
portion of the complement of a sequence selected from the group consisting of the sequence SEQ 

15 ID NO: land the sequence SEQ ID NO: 2, such that the primer pair defines a template sequence 
within a sequence selected from the group consisting of the sequence SEQ ID NO: 1 and the 
sequence SEQ ID NO: 2. The amplicon defined by said first and second primers is preferably 
between 50 and 250 nucleotides in length. The primers may optionally be labeled to facilitate 
their detection.. Methods and compositions for use in labeling primers are discussed further in 

20 the application in Section TEL j 
The invention includes a kit comprising (i) a first primer comprising a sequence which is 
substantially identical to a portion of a sequence selected from the group consisting of SEQ ID 
NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 7295, SEQ ID NO? 7296, SEQ 
ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301 

25 and (ii) a second primer comprising a sequence which is substantially complementary to a 

portion of a sequence selected from the group consisting of the sequence SEQ ID NO: 1 and the 
sequence SEQ ID NO: 2, such that the primer pair (i) and (ii) defines a template sequence within 
a sequence from the group consisting of SEQ ID NO: 1 and SEQ ID NO: 2. Preferably, the (i) 
first primer comprises a sequence which is substantially identical to a portion of a sequence 

30 selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. Preferably, the 
(i) first primer comprises a sequence which is substantially identical to a portion of the sequence 
of SEQ ID NO: 7293. The primers may optionally be labeled to facilitate their detection. 
Methods and compositions for use in labeling primers are discussed further in the application in 
Section m. 

35 Other preferred kits comprise (i) a first primer comprising a sequence which is 

substantially identical to a portion of the complement of a portion of a sequence selected from 
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the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ ID NO: 7294, SEQ ID NO: 
7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ ID NO: 7299, SEQ ID 
NO: 7300 and SEQ ID NO: 7301 and (ii) a second prima: comprising a sequence which is 
substantially complementary to a portion of the complement of a sequence selected from the 
5 group consisting of SEQ ID NO: 1 and SEQ ID NO: 2, such that the primer pair defines a 
template sequence within a sequence selected from the group consisting of SEQ ID NO: 1 and 
SEQ ID NO: 2. 

Hie invention further includes an attenuated S ARS virus for use as a vaccine wherein the 
intergenic region has been mutated to reduce expression of the viral structural or nonstructural 

10 proteins. The attenuated SARS virus may comprises one or more additions, deletions or 

insertion in one or more of the intergenic regions of the viral genome. Preferably, the attenuated 
SARS virus comprises an addition, deletion or insertion in one or more occurrences of the 
sequence selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. 
Preferably, the addition, deletion or insertion occurs in one or more occurrences of SEQ ID NO: 

15 7293. 

The invention further comprises a small molecule which inhibits binding or association of 
the SARS viral replication machinery, such as a ribonucleoprotein, with the intergenic region of 
the viral genome. Preferably, the small molecule inhibits binding or association of the SARS 
viral machinery with a sequence selected from the group consisting of SEQ ID NO: 7292 and 

20 SEQ ID NO: 7293. Preferably, the small molecule inhbiits binding or association of the SARS 
viral machinery with SEQ ID NO: 7293. The invention further includes a method of screening 
for a small molecule for treatment of SARS viral infection comprising using an assay to identify 
a small molecule which interferes with the association of the SARS viral replication machinery 
with the intergenic region of the SARS viral genome. 

25 The invention further provides a novel SARS polynucleotide sequence SEQ ID NO: 9968. 

All six reading frames of this 690mer sequence are shown in Figure 1 13. The constituent amino 
acid sequences from Figure 1 13, having at least 4 amino acids, are listed as SEQ ID NOS: 9969 
to 10032. 

Accordingly the invention includes a polynucleotide sequence comprising SEQ ID NO: 
30 9968. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 9968. 
The degree of sequence identity is preferably greater than 50% (e.g., 60%, 70%, 80%, 85%, 
88%, 90%, 92%, 95%, 99% or more). 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 9968, including the amino acid sequences selected from the group consisting of 
35 SEQ ID NO s : 9969 to 10032. Preferably, the amino acid sequence comprises SEQ ID NO: 9997 
or comprises SEQ ID NO: 9998. 
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The invention also provides amino acid sequences having sequence identity to an amino 
acid sequence encoded by SEQ ID NO: 9968. The invention provides amino acids having 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO s : 
9969 to 10032. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 70%, 
5 80%, 85%, 88%, 90%, 92%, 95%, 99% or more). 

A portion of SEQ ID NO: 9968 matches with approximately 98% identity to a previously 
published SARS polynucleotide sequence, commonly referred to as "BNI-1" (SEQ ID NO: 
10033). BNI-1 was sequenced at Bernhard Nocht Institute for Tropical Medicine, National 
Reference Cento: for Tropical Infectious Diseases in Hamburg, Germany. Hie BNI-1 sequence 

10 was published on the WHO website on April 4, 2003 at http://www.who. int/csr/sars/primers/en and in 
Dorsten et aL 9 'Identification of a Novel Coronavirus in Patients with Severe Acute Respiratory 
Syndrome", New England Journal ofMedicine* published online at http://www.nejm.org on April 
10,2003. Both references are incorporated herein by reference in their entirety. Hie six reading 
frames of this 302mer sequence are shown in Figure 1 14 (see also Figure 129). The constituent 

15 amino acid sequences from Figure 1 14, having at least 4 amino acids, are listed as SEQ ID NO s : 
10034 to 10065. An alignment of SEQ ID NO: 10034 with SEQ ID NO: 9997 is shown in Figure 
130. 

: The invention provides for polynucleotide sequences comprising fragments of SEQ ID 
NO: 9968. In one embodiment, the fragment does not consist entirely of SEQ ID NO: 10033 or 
20 of a known coronavirus. 

. : The invention provides for amino acid sequences comprising fragments of an amino acid 
sequence encoded by SEQ ID NO: 9968. In one embodiment, the fragment does not consist 
entirely of an amino acid sequence encoded by SEQ ID NO: 10033 or a known coronavirus. 
Hie invention provides for amino acids comprising fragments of an amino acid sequence 
25 selected from the group consisting of SEQ ID NO s : 9969 to 10032. In one embodiment, the 
fragment does not consist entirely of an amino acid sequence encoded by the polynucleotide 
sequence of SEQ ID NO: 10033 or a known coronavirus. 

Approximately 100 nucleotides at the 5' end of SEQ ID NO: 9968 do not match any 
portion of the BNI-1 polynucleotide sequence (SEQ ID NO: 10033). This unmatched portion is 
JO set forth as SEQ ID NO: 10066. The invention thus further provides a polynucleotide comprising 
the sequence comprising SEQ ID NO: 10066, polynucleotide sequences having sequence 
identity to SEQ ID NO: 10066, or polynucleotide sequences comprising fragments of SEQ ID 
NO: 10066. 

Hie invention further comprises an amino acid sequence encoded by SEQ ID NO: 10066, 
J5 an amino acid sequence having sequence identity to an amino acid sequence encoded by SEQ ID 
NO: 10066, or an amino acid sequence comprising fragments of an amino acid sequence 
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encoded by SEQ ID NO: 10066. Preferably, the amino arid sequence comprises SEQ ID NO: 
10067. 

SEQ ID NO: 9997/9998 demonstrates homology with the a region of pollab of several 
coronaviruses. FIGURE 115 shows an alignment of SEQ ID NO s : 9997/9998 to amino acid 
5 sequences for pollab of bovine coronaviius (SEQ ID NO: 10068), avian infectious bronchitis 
1 virus (SEQ ID NO: 10069) and murine hepatitis virus (SEQ ID NO: 10070). A consensus amino 
arid sequence of SEQ ID NO s : 9997/9998, SEQ ID NO: 10068, SEQ ID NO: 10069, and SEQ 
ID NO: 10070 is shown in the bottom row of the alignment in Figure 115 (e.g. SEQ ID NO: 
10071). 

10 As shown in FIGURE 113, the polynucleotide sequence encoding SEQ ID NO: 9997 has a 

stop codon after codon 205, between SEQ ID NO s : 9997 and 9998. Optionally, the stop codon 
can be removed and the amino arid sequence continued (SEQ ID NO: 10072). Accordingly, the 
invention provides for an amino acid sequence comprising SEQ ID NO: 9997 and/or SEQ ID 
NO: 9998, or SEQ ID NO: 10072, and further comprising an amino arid sequence encoding for 

15 the C-tenninus of a coronavirus pollab gene or a fragment thereof. 

As shown in FIGURE 115, SEQ ID NO s : 10068, 10069, 10070 and 10071 contain amino 
acids prior to the N-terminus of SEQ ID NO: 9997. The invention also provides for an amino 
arid sequence comprising SEQ ID NO: 9997 and further comprising an amino acid sequence 
encoding for the N-terminus of a coronavirus pol lab protein or a fragment thereof. 

20 The pol lab sequences on FIGURE 1 15 contain a coding region indicated on the 

schematic of FIGURE 1 17 by a "*". In FIGURE 1 15, the beginning of this genomic region is 
designated by flie arrow crossing in front of amino arid 6080 of the consensus sequence SEQ ID 
NO: 10071. Hie end of this genomic region is designated by the arrow crossing in front of 
amino arid 6604 of the consensus sequence. Hie invention provides for an amino arid sequence 

25 comprising SEQ ID NO: 9997 and/or SEQ ID NO: 9998, or SEQ ID NO: 10072, and further 

comprising a first amino sequence prior to the N-terminus of said SEQ ID NO: 9997 and/or SEQ 1 
ID NO: 9998, or SEQ ID NO: 10072, wherein said first amino arid sequence has homology to an - 
N-terminus sequence of a known coronavirus pollab "*" protein or a fragment thereof. 

The invention further provides for an amino arid sequence comprising SEQ ID NO: 9997 

30 and SEQ ID NO: 9998, wherein the stop codon after SEQ ID NO: 9971 is removed (Le. SEQ ID 
NO: 10072), and further comprising a second amino arid sequence following the C terminus of 
SEQ ID NO: 9998, wherein said second amino arid sequence is homologous with a C terminus 
of a known coronavirus pollab protein or a fragment thereof. 

Examples of such proteins are shown aligned in FIGURE 118, and are SEQ ID NO s : 

35 10073 to 10077. SEQ ID NO: 10073 comprises SEQ ID NO: 9997 and further comprises amino 
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acids prior to the N-terminus and subsequent to the C-terminus from the pollab protein of 
avian infectious bronchitis virus. SEQ ID NO: 10074 comprises SEQ ID NO: 9997 and further 
comprises amino acids prior to the N-tenninus and subsequent to the C-terminus from the 
pollab protein of bovine coronavirus. SEQ ID NO: 10075 comprises SEQ ID NO: 9997 
5 and further comprises amino acids prior to the N-tenninus and subsequent to the C-terminus 
from the pollab protein of murine hepatitis virus. SEQ ID NO: 10076 comprises SEQ ID 
NO: 9997 and further comprises amino acids prior to the N-tenninus and subsequent to the 
C-terminus from the consensus of the pollab protein of avian infectious bronchitis virus, 
bovine coronavirus, and murine hepatitis virus (FIGURE 115). SEQ ID NO: 10077 comprises 

10 the consensus sequence of SEQ ID NOS: 10073 to 10076. 

The invention comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO s : 10073, 10074, 10075, 10076 and 10077. The invention further includes an amino 
acid sequence comprising fragments of an amino acid sequence selected from the group 
consisting of SEQ ID NO s : 10073, -10074, 10075, 10076 and 10077. The invention further 

15 comprises an amino acid sequence with sequence identity to a sequence selected from the group 
consisting of SEQ ID NO s : 10073, 10074, 10075, 10076 and, 10077. 

The invention comprises polynucleotides encoding for the amino acid sequences selected 
from the group consisting of SEQ ID NO s : 10073, 10074, 10075, 10076 and 10077. The 
invention comprises polynucleotides having sequence identity to polynucleotides encoding for 

20 the amino acid sequences selected from the group consisting of SEQ ID NO s : 10073, 10074, 

10075, 10076 and 10077. The invention comprises fragments of polynucleotides encoding SEQ * 
ID NO 55 : 10073, 10074, 10075, 10076 and 10077. 

As shown in Figure 113, SEQ ID NO: 9968 includes a sequence that encodes SEQ ID NO: 
10020 followed by a stop codon, giving a C-terminus threonine (Thr) residue. The corresponding 

25 sequence from an amino acid sequence encoded by BNI-1 is SEQ ID NO: 10078, which 
continues past the C-terminus of SEQ ID NO: 10020. Accordingly, the invention includes a 
protein comprising amino acid sequence SEQ ID NO: 10020 or an amino acid sequence having 
sequence identity to SEQ ID NO: 10020 or an amino acid sequence comprising a fragment of 
SEQ ID NO: 10020, wherein the C-terminus residue of said protein is a threonine. Preferably, 

30 the C-terminus of said protein is -ST. Still more preferably, the C-terminus of said protein is - 
EST. The invention also includes a protein comprising amino acid sequence SEQ ID NO: 10078 
or an amino acid sequence having sequence identity to SEQ ID NO: 10078 or an amino acid 
sequence comprising a fragment of SEQ ID NO: 10078, wherein the C-terminus residue of said 
protein is Thr. Preferably, the C-terminus of said protein is -ST. Still more preferably, the C- 

55 terminus of said protein is -EST. 
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SEQ ID NO: 9968 also encodes a 54mer amino acid sequence SEQ ID NO: 10015. Hie 
polynucleotide encoding SEQ ID NO: 10015 encodes two stop codons at its C-terminus (Figure 
113). The corresponding region from the BNI-1 sequence does not contain this 54mer. 
Accordingly, the invention includes a protein comprising amino arid sequence SEQ ID NO: 
5 10015, or an amino acid sequence having sequence identity to SEQ ID NO: 10015 or an amino 
acid sequence comprising a fragment of SEQ ID NO: 10015. Hie invention further includes a 
polypeptide comprising SEQ ID NO: 10015 and further comprising a first amino acid sequence 
prior to the N-terminus of SEQ ID NO: 10015. 

SEQ ID NO: 9968 encodes the amino arid sequence SEQ ID NO: 9969. Hie 

10 polynucleotide sequence contains a stop codon at the C-terminus of SEQ ID NO: 9969. 

Accordingly, the invention includes a protein comprising amino arid sequence SEQ ID NO: 
9969, or an amino arid sequence having sequence identity to SEQ ID NO: 9969. The invention 
further includes a polypeptide comprising SEQ ID NO: 9969 and further comprising a first 
amino arid sequence prior to the N-tenninus of SEQ ID NO: 9969. The invention further 

15 includes a polypeptide comprising the sequence SEQ ID NO: 10079. 

SEQ ID NO: 9968 encodes amino arid sequence QRT (Figure 113), followed by a stop 
codqn. Accordingly, the invention includes a protein comprising amino acid sequence QRT. The 
invention further includes a polypeptide comprising amino acid sequence QRT and further 
comprising a first amino arid sequence prior to the N-terminus of the sequence QRT. 

20 SEQ ID NO: 9968 encodes amino arid sequence SEQ ID NO: 10022, followed by a stop 

codon at its C-terminus. Accordingly, the invention includes a protein comprising amino arid 
sequence SEQ ID NO: 10022, or an amino arid sequence having sequence identity to SEQ ID 
NO: 10022. The invention further includes a polypeptide comprising SEQ ID NO: 10022 and 
further comprising a first amino arid sequence prior to the N-terminus of SEQ ID NO: 10022. 

25 SEQ ID NO: 9968 encodes amino acid sequence SEQ ID NO: 10027. Within the SEQ ID 

NO: 10027 coding sequence there are at least three start codons, identified with underlining in 
Rgure 119. The open reading frame indicated by the first start codon is SEQ ID NO: 10081. The 
open reading frame indicated by the second start codon is SEQ ID NO: 10082. The open reading 
frame indicated by the third start codon is SEQ ID NO: 10083. 

30 The invention provides a novel SARS polynucleotide sequence SEQ ID NO: 10084. All 

six reading frames of this 1463mer sequence are shown in Figure 120 (see also Figure 122). Hie 
constituent amino acid sequences from Figure 120, having at least 4 amino acids, are listed as 
SEQ ID NOS: 10085 to 10209 (see Figures 120A to 120F). 

The invention includes a polynucleotide sequence comprising SEQ ID NO: 10084. The 

35 invention also provides polynucleotide sequences having sequence identity to SEQ ID NO: 

10084. The invention also provides for polynucleotide sequences comprising fragments of SEQ 

-82- 



WO 2004/092360 



PCT/US2004/011710 



ID NO: 10084. In one embodiment, the polynucleotide fragment does not consist entirely of 

SEQ ID NO: 10033 or a known coronavirus polynucleotide sequence or a known SARS 

polynucleotide sequence. 

Hie invention includes an amino acid sequence encoded by the polynucleotide sequence of 
5 SEQ ID NO: 10084, including the amino acid sequences of Figures 120A to 120F e.g. selected 

from the group consisting of SEQ ID NO s : 10085 to 10209. Preferably, the amino acid sequence 

comprises SEQ ID NO: 10149. 

The invention also provides amino acid sequences having sequence identity to an amino 

acid sequence encoded by SEQ ID NO: 10084. The invention provides amino acids having 
10 sequence identity to an amino acid sequence from Figures 120A to 120F e.g. selected from the 

group consisting of SEQ ID NO s : 10085 to 10209 ' 

The invention also provides fragments of amino acid sequences encoded by SEQ ED NO: 

10084. The invention also provides fragments of amino acid sequences selected from the group 

consisting of SEQ ID NO s : 10085 to 10209. In one embodiment, the fragment does not consist 
15 entirely of an amino acid sequence encoded by SEQ ID NO: 10033 or an amino acid sequence of 

a known coronavirus or an amino acid sequence of a known SARS virus. An alignment of the 

matching portion of SEQ ID NO: 10033 and SEQ ID NO: 10084 is included in FIGURE 121. 
In one embodiment, the invention comprises an amino acid sequence comprising SEQ ID 

NO: 10149. An alignment of the polynucleotide sequence SEQ ID NO: 10084 to the encoded 
20 SEQ ID NO: 10149 is shown in FIGURE 122 (5'3' Frame 3). Analysis of the 53' Frame 3 

translation bya computer program to predict start codon methionines (NetStart 1.0) (FIGURE 

123) reveals SEQ ID NO s : 10210 to 10215. 

The invention includes a protein comprising an amino acid sequence selected from the 

group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ ID NO: 
25 10213, SEQ ID NO: 10214 and SEQ ID NO: 10215. Hie invention includes a protein having 

sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 

10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ ID NO: 10213, SEQ ID NO: 10214 and 

SEQ ID NO: 10215. In one embodiment, the protein does not consist entirely of an amino acid 

sequence of a known SARS virus or of a known coronavirus. 
30 The invention includes a fragment of a protein comprising an amino acid sequence selected 

from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211, SEQ ID NO: 10212, SEQ 

ID NO: 10213, SEQ ID NO: 10214 and SEQ ID NO: 10215. In one embodiment, the fragment 

does not consist entirely of an amino acid sequence of a known SARS virus or of a known 

coronavirus. 

35 In one embodiment, the invention includes a polypeptide comprising the amino acid 

sequence selected from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 10211 and 
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SEQ ID NO: 10212. Partial results of a BLAST of SEQ ID NO: 10210 against GenBank is 
included in FIGURE 124. These results indicate that SEQ ID NOS: 10210, 10211 and 10212 
have functional similarities to a Coionaviius RNA polymerase, particularly the RNA polymerase 
of murine hepatitis virus, bovine coionavirus, and avian infectious bronchitis. 

5 In one embodiment, the invention is directed to a polypeptide comprising a first amino acid 

sequence selected from the group consisting of SEQ ID NO: 10210, SEQ ID NO: 1021 1 and 
SEQ ID NO: 10212 and a second amino acid sequence from the C-terminus of a coronavirus 
ORFlab sequence. Preferably, the second amino acid sequence is from a bovine coronavirus. 
One example of this embodiment is shown below as SEQ ID NO: 10216. Amino acids 1-481 of 

10 SEQ ID NO: 10216 are the first amino acid sequence of SEQ ID NO: 10210, and amino acids 
482-1152 are the second amino acid sequence of the C-terminus of a bovine coronavirus 
orilab polyprotein (Gi 26008080) (NP_150073.2) (SEQ ID NO: 10217). 

Accordingly, the invention includes a polypeptide comprising SEQ ID NO: 10216. The 
invention further includes a polypeptide comprising a first amino acid sequence of SEQ ID NO: 

15 10210 and a second amino acid sequence of SEQ ID NO: 10217. The invention further includes- 
a polypeptide comprising a first amino acid sequence having greater than x% identity to SEQ ID 
NO: 10210 and a second amino acid sequence having greater than y% identity to SEQ ID NO: 
10217, wherein x is greater than or equal to 85% (e.g., 86%, 87%, 88%, 89%, 90%, 91%, 92%, ' 
93%, 94%, 95%, 96%, 97%, 98%, 99% or more) and wherein y is greater than or equal to 60% 

20 (e.fc., 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more). 

The invention also includes a polypeptide comprising a fragment of SEQ ID NO: 10210, - 
wherein said fragment includes an epitope. Computer-predicted epitopes of SEQ ID NO: 10210, 
using a 17mer window, are included in FIGURE 125 A (Hopp & Woods) and FIGURE 125B 
(Kyte&Doolittle). 

25 The amino acid sequence of SEQ ID NO: 10210 also contains two predicted glycosylation 

sites at amino acids 81-84 (NNTE; SEQ ID NO: 10218) and at 180-183 (NHSV; SEQ ID NO: 
10219). Accordingly, the invention includes a polypeptide comprising a fragment of SEQ ID 
NO: 10210, wherein said fragment includes a glycosylation site. The invention further includes 
a polypeptide comprising a fragment of SEQ ID NO: 10210, wherein said fragment includes the 

30 Asn at position 81. Preferably, said Asn is glycosylated. The invention further includes a 
polypeptide comprising a fragment of SEQ ID NO: 10210, wherein said fragment includes the 
Asn at position 180. Preferably, said Asn is glycosylated. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
sequence from within Figure 120D and/or SEQ ID NO s : 10150 to 10160 e.g. from SEQ ID NO s : 

35 10154, 10155, 10158 and 10160. Within SEQ ID NO: 10154 the following amino acid sequences 

starting with a Met and ending at a stop codon can be identified: SEQ ID NO s : 10220 to 10227. 
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Accordingly, the invention includes a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 10220, SEQ ID NO: 10221, SEQ ID NO: 
10222, SEQ ID NO: 10223, SEQ ID NO: 10224, SEQ ID NO: 10225, SEQ ID NO: 10226 and 
SEQ ID NO: 10227, or a fragment thereof or an amino arid sequence having sequence identity 
5 thereto. 

In one embodiment, the invention includes a polypeptide comprising the amino acid 
sequence within Figure 120E e.g. from SEQ ID NO s : 10161 to 10182, and in particular SEQ ID 
NOS: 10171 and 10176. Within SEQ ID NO s : 10171 and 10176 the following amino arid 
sequences starting with a Met and ending at a stop codon can be identified: SEQ ID NO: 10228 

10 and SEQ ID NO: 10229. 

Accordingly, the invention includes a polypeptide comprising an amino arid sequence 
selected from the group consisting of SEQ ID NO: 10228 and SEQ ID NO: 10229, or a fragment 
thereof or an amino arid sequence having sequence identity thereto. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 

15 sequence from Figure 120F e.g. SEQ ID NO s : 10183 to 10209. Within Figure 120F the 

following amino acid sequence starting with a Met and ending at a stop codon can be identified: 
SEQ ID NO: 10187. Accordingly, the invention includes a polypeptide comprising an amino 
arid sequence of SEQ ID NO: 10187, or a fragment thereof or an amino acid sequence having 
sequence identity thereto. 

20 In one embodiment, the polynucleotides of the invention do not include one of the 

following primers, disclosed at http://contert.nejni.org/cgi/reprira/NE^ 

5 » GGGTTGGGACTATCCTAAGTGTGA3 1 (SEQ ID NO: 10230) 

5 ' TAACACACAACICCATCATCA3 ' (SEQ ID NO: 10231) 

5 • CTAACATGCTTAGGATAATGG3 ■ (SEQ ID NO: 10232) 

25 5 • GCCTCTCTTGTTCTTGCTCGC3 ' (SEQ ID NO: 10233) 

5 ' CAGGTAAGCGTAAAACTCATC3 1 (SEQ ID NO: 10234) 

The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 

30 or identify the presence or absence of a S ARS virus in a biological sample. The invention 

includes the polynucleotide primers identified in Table 31 (SEQ ID NO s : 10235 to 10258), the 
forward primers SEQ ID NO s : 10259 to 10281 and the reverse primers SEQ ID NO s : 10282 to 
10298. The invention further includes polynucleotide sequences which are complementary to 
any one of these primer sequences disclosed herein. 

35 The invention provides a SARS polynucleotide sequence SEQ ID NO: 10299. All six 

reading frames of this sequence are included in FIGURE 126 (See also Figure 131). Hie 
constituent amino acid sequences from Figure 126, having at least 4 amino acids, are listed as 
SEQ ID NOS: 10300 to 10337. 
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Accordingly, the invention includes a polynucleotide sequence comprising SEQ ID NO: 
10299. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
10299. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 10299. In one embodiment, the polynucleotide fragment does not consist entirely of a 
5 known polynucleotide sequence of a SARS virus or a known polynucleotide sequence of a 
coronavirus. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 10299, including the amino acid sequences shown in Rgure 126, and the amino 
acid sequences selected from the group consisting of SEQ ID NO s : 10300 to 10337. Preferably, 
10 the amino acid sequence comprises SEQ ID NO: 10316. 

The invention also provides amino acid sequences having sequence identity to an amino 
acid sequence encoded by SEQ ID NO: 10299. The invention provides amino acid sequences 
having identity to an amino acid sequence selected from the group consisting of SEQ ID NO s : 
10300 to 10337. 

15 ' The invention also provides fragments of amino acid sequences encoded by SEQ ED NO: 
10299. The invention also provides fragments of amino acid sequences selected from the group 
consisting of SEQ ID NO s : 10300 to 10337. In one embodiment, the fragment does not consist 
entirely of a known amino acid sequence of a SARS virus or a known amino acid sequence of a 
coronavirus. 

20 - Li one embodiment, the invention comprises an amino add sequence comprising SEQ ID ■ 
NO: 10316. Encoded open reading frames within SEQ ID NO: 10316 include SEQ ID NO: 
10338 and SEQ ID NO: 10339. 

In one embodiment, the invention comprises an amino acid sequence comprising a 
sequence from within the 53' Frame 1 translation of SEQ ID NO: 10299. The following encoded 
25 open reading frame is found within this translation: SEQ ID NO: 10340. 

In one embodiment, the invention comprises an amino acid sequence comprising a 
sequence from within the 3'5' Frame 1 translation of SEQ ID NO: 10299. An encoded open 
reading frame within this translation is SEQ ID NO: 10341. 

In one embodiment, the invention comprises an amino acid sequence comprising a 
30 sequence from within the 3'5 f Frame 2 translation of SEQ ID NO: 10299. An encoded open 
reading frame within this translation is SEQ ID NO: 10342. 

The invention includes a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 10338, SEQ ID NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 
10341 and SEQ ID NO: 10342. The invention includes a polypeptide having sequence identity 
35 to an amino acid sequence selected from the group consisting of SEQ ID NO: 10338, SEQ ID 
NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 10341 and SEQ ID NO: 10342. The invention 
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includes a fragment of a polypeptide comprising an amino acid sequence elected from the group 
consisting of SEQ ID NO: 10338, SEQ ID NO: 10339, SEQ ID NO: 10340, SEQ ID NO: 10341 
and SEQ ID NO: 10342. In one embodiment, the fragment does not consist entirely of a known 
S ARS vims amino acid sequence or of a known coronaviius amino acid sequence. 

In one embodiment, SEQ ID NOS: 10338-10342 are used in fusion proteins. Accordingly, 
the start codon methionines may be removed The invention comprises a amino acid sequence 
selected from the group consisting of SEQ ID NO: 10343, SEQ ID NO: 10344, SEQ ID NO: 
10345, SEQ ID NO: 10346 and SEQ ID NO: 10347. 

In one embodiment, the invention comprises an amino acid sequence selected from the 
group consisting of SEQ ID NO: 10338 and SEQ ID NO: 10339. Partial BLAST results of SEQ 
ID NO: 10338 against GenBank are given below: 

>gi| 133593 |sp|P18457|RRPB_CVPFS RNA-DIRECTED RNA POLYMERASE (ORFlB) 

gi 1 93934 |pir | |A43489 RNA-directed RNA polymerase (EC 2.7.7.48) - porcine 
transmissible gastroenteritis virus (fragment) 

gi|83316l|emb|CAA37284.1| polymerase [Transmissible gastroenteritis^ 

virus] 

Length =533 
Score = 131 bits (329), Expect = 3e-30 

Identities = 55/89 (61%), Positives = 69/89 (77%), Gaps = 1/89 (1%) 

Query: 1 . MLWCKDGHVETFYPKLQASQAW^ PKGIMMN 60 

MLWC++ H+ +TFYP+LQ+ ++ W PG +MP LYK+QRM LE+C+L NYG +P GI N 
Sbjct: 217 MLWCENSHIKTFYPQLQSAE-WNPGYSMPTLYKIQR 215 

Query: 61 VAKYTQLCQYLNTLTLAVPSNMRVTHFGA 89 

V KYTQLCQYLNT TL VP MRV+H GA 
Sbjct: 276 VVKYTQLCQYLNTTTLC^ 304 

These results indicate that SEQ ID NO: 10338 has functional similarities to an RNA- 
directed RNA polymerase of porcine transmissible gastroenteritis virus. 

Partial BLAST results of SEQ ID NO: 10339 against GenBank are given below: 

>gb|AAL57305.l| replicase [bovine coronavirus] 
Length = 7094 

Score = 139 bits (351) , Expect = 7e-33 

Identities = 64/108 (59%), Positives = 78/108 (72%) 

Query: 1 MSVT SKVVKVTIDYAEI SFMLWCKIX^HVETFYPKLQAS QAWQPGVAMPNLYKMQRMLLEK 60 

M+ +SKW V +D+ + FMLWC D V TFYP+LQA+ W+PG +MP LYK +E+ 
Sbjct: 6760 LNCVSKVVNVNVDFKDFQFMLWCNDEKVMTFW 



Query: 61 CDLQNYGENAVIPKGIMMNVAKYTQI£QYL^^ 108 

L NYG+ +P G MMNVAKYTQLCQYLNT TLAVP NMRV+H GA 
Sbjct: 6820 VSLWNYGKFVTLPTGCMMNVAKYTQLC^ 6867 

These results indicate that SEQ ID NO: 10339 has functional similarities to a replicase of 
bovine coronavirus. 
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The SARS virus may contain polymorphism at the Glu-20 residue of SEQ ID NO: 10338. 
The invention includes a polypeptide comprising an amino acid sequence having sequence 
identity to SEQ ID NO: 10338, wherein said polypeptide includes an amino acid sequence 
selected from the group consisting of ASQAW (SEQ ID NO: 10348) and AS RAW (SEQ ID NO: 
5 10349). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 10338, 
wherein said fragment includes an amino acid sequence selected from the group consisting of 
SEQ ID NO: 10348 and SEQ ID NO: 10349. 

Hie SARS vims may contain polymorphism at the Ser-80 residue of SEQ ID NO: 10338. 
below. The invention includes a polypeptide comprising an amino acid sequence having 

10 sequence identity to SEQ ID NO: 10338, wherein said polypeptide includes an amino acid 

sequence selected from the group consisting of VPSNM (SEQ ID NO: 10350) and VPTNM (SEQ 
ID NO: 10351). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
10338, wherein said fragment includes an amino acid sequence selected from the group 
consisting of SEQ ID NO: 10350 and SEQ ID NO: 10351. 

15 The invention also includes polynucleotide sequences which can be used as probes for 

diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
Table 32. The invention further includes polynucleotide sequence comprising the complement of 

20 one or more of the primer sequences identified in Table 32. 

The invention provides a SARS polynucleotide sequence SEQ ID NO: 10505. All six 
reading frames of this sequence are shown in Figure 127 (see also Figure 132). The constituent 
amino acid sequences from Figure 127, having at least 4 amino acids, are listed as SEQ ID NOS: , 
10506 to 10570. 

25 The invention includes a polynucleotide sequence comprising SEQ ID NO: 10505. The 

invention also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
10505. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 10505. In one embodiment, the polynucleotide fragment does not consist entirely of a 
known SARS virus polynucleotide sequence or of a known coronavirus polynucleotide 

30 sequence. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence of 
SEQ ID NO: 10505, including the amino acid sequences shown in Figure 127, and particularly 
those selected from the group consisting of SEQ ID NO s : 10506 to 10570. Preferably, the amino 
acid sequence comprises SEQ ID NO: 10532 and/or SEQ ID NO: 10533. 
35 The invention also provides amino acid sequences having sequence identity to an amino 

acid sequence encoded by SEQ ID NO: 10505. The invention provides amino acid sequences 
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having sequence identity to an amino acid sequence selected from the group consisting of the 
sequences shown in Figure 127, and in particular SEQ ID NO s : 10506 to 10570. 

The invention also provides fragments of amino acid sequences encoded by SEQ ID NO: 
10505. The invention also provides fragments of amino acid sequences selected from the group 
consisting of SEQ ID NO s : 10506 to 10570. In one embodiment, the fragment does not consist 
entirely of a known amino acid sequence of a SARS virus or a known amino acid sequence of a 
coronavirus. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
sequence from the 53' Frame 3 of Figure 127.. Some encoded open reading frames within this 
translation are: SEQ ID NO: 10533; SEQ ID NO: 10571; SEQ ID NO: 10572; SEQ ID NO: 
10573; SEQ ID NO: 10574. 

The invention includes a polypeptide sequence comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 10533, SEQ ID NO: 10571, SEQ ID NO: 
10572, SEQ ID NO: 10573 and SEQ ID NO: 10574. The invention includes a polypeptide 
having sequence identity to the amino acid sequence selected from the group consisting of SEQ 
ID NO: 10533, SEQ ID NO: 10571, SEQ ID NO: 10572, SEQ ID NO: 10573 and SEQ ID NO: 
10574. The invention includes a fragment of a polypeptide sequence comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 10533, SEQ ID NO: 10571, SEQ 
ID NO: 10572, SEQ ID NO: 10573 and SEQ ID NO: 10574. 

Partial BLAST results of SEQ H> NO: 10533 against GenBank are given below: 

>gi|7739601|gb|AAP68926.1|AF207902_U nucleocapsid protein [murine 

hepatitis virus strain ML- 11] imurme 
Length = 451 

Score = 147 bits (370), Expect = 3e-34 

Identities = 102/252 (40%), Positives = 137/252 (54%), Gaps = 18/252 (7%) 

Query: 49 SWFTALTQHGK-EELRFPRGQGV^ 106 

SWF+ +TQ K +E +P +GQGVPI + +Q GY+ R RR + DG+ K +L PRW 

Sbuct: 63 SWFSGITQFQKGKEFQFAQGQGVPIASGIPASEQKGYWYRHNRRSPKTPDGQHKQLLPRW 122 

Query: 107 YFYYIXJTGPEASLPYGANKEGr^^ 166 

YPYYLGTGP A YG + EG+VWVA++ A + R+P+++ A + GT LP 

Sbjct: 123 YFYYLGTGPHAGAEYGDDIEGVVWVASQQADTKTTADVVERDPSSHEAIPT^ 182 

Query: 167 KGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPAPJIASGGGOT 226 

+GFY EGS + AS S N SS PA +A L+L +L 

Sbjct: 183 Q^FYVEGSGRSAPASRSGSRSQSRGPNNRARSSSNQRQPASAVKPDMAEEIAALVIAKLG 242 

Query: 227 QLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAPGRRGPEQTQG 282 

OK . . ... + 00 + Q VTK+SA E + KPRQKRT KQ V Q PG+RGP Q 
Sb 3 ct: 243 K DAGQPKQ VTKQSAKEVRQKILTKPRQKRTPNKQCPVQQCPGKRGPNQ 290 

Query: 283 NFGDQDLIRQGT 294 

NFG ++++ GT 
Sbjct: 291 NFGGSEMLKLGT 302 
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>gi|3132999|gb|AAC16422.1| nucleocapsid protein [murine hepatitis virus 
strain 2] 

Length = 451 
Score = 147 bits (370), Expect = 3e-34 

Identities = 102/252 (40%), Positives = 137/252 (54%), Gaps = 18/252 (7%) 

Query: 49 SWFTALTQHOT-EKLRFPRGQC^ 106 

SWF+ +TQ K +E +F +GQGVPI + +Q GY+ R RR + DG+ K+L PRW 

Sbjct: 63 SWFSGITQFQKGKEFQFAQGQGVPIASGI PASEQKGYWYRHNRRSFKTPDGQHKQLLPRW 122 

Query: 107 YFYYLGTGPEASLPYGANKEGrVVfi/AT^ 166 

YFYYLGTGP A YG + EG+VWVA++ A + R+P+++ A + GT LP 

Sbjct: 123 YFYYLGTGPHAGAEYOTDIEGVvWASQQAOT 182 

Query: 167 KGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPAR 226 

+GFY EGS + AS S N SS PA +A L+L +L 

Sbjct: 183 QGFYVEGSGKSAPASRSGSRSQSRGPNNRARSSSNQRQPASAVKPDMAEEIAALV^ 242 

Query: 227 QLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQG 282 

+ GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 
Sbjct: 243 K DAGQPKQ VTKQSAKEVRQKILTKPRQKRT PNKQC PVQQC PGKRGPNQ 290 

Query: 283 NFGDQDLIRQGT 294 

NFG ++++ GT 
Sbjct: 291 NFGGSEMLKLGT 302 



>gi|127877|sp|P03417|NCAP_CVMJH Nucleocapsid protein 
gi j 74859 |pir j |VHIHMJ nucleocapsid protein - murine hepatitis virus 

(strain JHM) 

gi|58973|emb|CAA25497.l| nucleocapsid protein [Murine hepatitis virus] 
Length = 455 

Score = 146 bits (369), Expect = 4e-34 

Identities = 110/254 (43%), Positives = 142/254 (55%), Gaps = 22/254 (8%) 

Query: 49 SWFTALTQHGK-EELRFPRGQGVTINTNSGPDDQIGYYRRA PRW 106 

SWF+ +TQ K +E +P +GQGVPI Q GY+ R RR + DG+ K+L PRW 

Sbjct: 67 SWFSGITQFQKGKEFQFAQGQGVPIANGIPASQQKGYWYRHNRRSFKTPIXSQQKQI^ 126 

Query: 107 YFYYLGTGPEASLPYGANKEGIVWATEGALNTPKDPriG 166 

YFYYLGTGP A YG + EG+VWVA++ A I R+P+++ A + GT LP 

Sbjct: 127 YFYYLGTGPYAGAEYGDDIEGVVWVASQQAETRTSADIVERDPS SHEAI PTRFAPGTVLP 186 

Query: 167 KGFYAEGSRGGSQASSRSSSR — SRGNSRNSTPGSSRGNSPARMASGGGETALALLLLDR 224 

+GFY EGS G S +SRS SR SRG N SS PA +A L+L + 

Sbjct: 187 QGFYVEGS -GRSAPASRSGSRPQSRG-PNNRARS SSNQRQPASTVTCPDMAEEIAALVLAK 244 

Query: 225 LNQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQ YNVTQAFGRRG PEQT 280 

L + GQ +Q VTK+SA E + KPRQKRT KQ V Q FG+RGP Q 
Sbjct: 245 LGK DAGQPKQ VTKQSAKEVRQKILNKPJRQKRTPNKQC PVQQC FGKRGPNQ - 294 

Query: 281 QGNFGDQDLIRQGT 294 

NFG ++++ GT 
Sbjct: 295 — NFGGPEMLKLGT 306 
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>gi 1 6625766 1 gb| AAF19389 . 1 | AF201929J7 nucleocapsid protein [murine 

hepatitis virus strain 2] 

gi 1 7769348 1 gb| AAF69338 . 1 | AF208066_11 nucleocapsid protein [murine 

hepatitis virus] 

Length = 451 

Score = 146 bits (368), Expect = 5e-34 

Identities = 102/252 (40%), Positives « 137/252 (54%), Gaps = 18/252 (7%) 



Query: 


49 


SWFTMjTQHGK-EELRFPRCXJGVPIOTTO^ 


106 






SWP+ +TQ K +E +P +GQGVPI + +Q GY+ R RR + DG+ K+L PRW 




Sbjct : 


63 


SWFSGITQFQKGKEFQFAQGQGWIASGI PASEQKGYWYRHNRRSFKTPDGQHKQIi 


122 


Query: 


107 


YFYYLGTGPEASLPYGANKEGIVWVM 


166 






YFYYLGTGP A YG + EG+VWVA++ A + R+P+++ A + GT LP 




Sbjct: 


123 


YFYYLGTGPHAGAEYGDDIEGVVWASQQAIXrKT^ 


182 


Query: 


167 


KGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGOTALALLLI^ 


226 






+GPY EGS + AS S N SS PA +A L+L +L 




Sbjct: 


183 


QGFYVK3SGRSAPASRSGSRSQSRGPNNRARSSSNQRQPASAVTOT 


242 


Query: 


227 


QLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAPGRRGPEQTQG 


282 






+ GQ +Q VTK+SA E + KPRQKRT KQ V Q PG+RGP Q 




Sbjct: 


243 


K DAGQPKQ VTKQSAKEVRQKILTKPRQKRTPNKQCPVQQCFGKRGPNQ- — 


29a 


Query: 


283 


NFGDQDLIRQGT 294 








NFG ++++ GT 




Sbjct: 


291 


NFGGSEMLKLGT 302 





>gi|21734854 | gb| AAM77005. 1 1 AF481863_7 phosphorylated nucleocapsid- protein 
N [porcine hemagglutinating encephalomyelitis virus] 
Length = 449 

Score = 145 bits (366), Expect = 8e-34 

Identities = 107/253 (42%), Positives = 145/253 (57%), Gaps = 18/253 (7%) ' 



Query: 


49 


SWFTALTQHGK- EELRFPRGQG\n?IOTTCSGPDDQIGYY^ 


106 






SWF+ +TQ K +E F GQGVPI + GY+ R RR + DG ++L PRW 




Sbjct: 


64 


SWF SGITQFQKGKEFEFAEGQGVPIAPGVPATEAKGYWYRHNR 


123 


Query: 


107 


YFYYLGTGPEASLPYGANK^rVWATEGA-LNT^ 


165 






YFYYLGTGP A YG + +G+ WVA+ A +NTP D I R+P+++ A + P GT L 




Sbjct: 


124 


YFYYLGTGPHAKHQYGTDIDGVFWVASNQADIOT 


182 


Query: 


166 


PKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALALIJ^^ 


225 






P+G+Y EGS G S +SRS+SR+ N S SR NS R ++ G +A D++ 




Sbjct: 


183 


PQGYYIEGS -GRSAPNSRSTSRA-PNRAPSAGSRSRANSGNRTSTPGVTPDMA DQI 


236 


Query: 


226 


NQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQ 


281 






L GK + Q VTK++A E + KPRQKR+ KQ V Q FG+RGP Q 




Sbjct: 


237 


ASLVLAKLGK-DATKPQQVTKQTAKE^ - 


293 


Query: 


282 


GNFGDQDLIRQGT 294 








NFG ++++ GT . 




Sbjct: 


294 


-NFGGGEMLKLGT 305 
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>gi| 23295765 |gb|AAL80036.1| nucleocapsid protein [porcine 

nonagglutinating encephalomyelitis virus] 
Length = 449 

5 Score = 145 bits (365), Expect = le-33 

Identities = 107/253 (42%), Positives = 145/253 (57%), Gaps = 18/253 (7%) 

Query: 49 SWFTAI»TQHGK- KKLRFPRGQGVP Il^n^SGPDDQlGYYRI^TRR-VllGGDGKMKKLS PIW 106 
SWP+ +TQ K +E F GQGVPI + GY+ R RR + DG ++L PRW 

10 Sbjct: 64 SWFSGITQFQKGKEFEFABGQGVTIAPGVPSTEA^ 123 

Query: 107 YFYYLGTGPEASLPYGANKEGIVWATEGA-I^ 165 

YFYYLGTGP A YG + +G+ WVA+ A +NTP D I R+P+++ A + P GT L 
Sbjct: 124 YFYYLGTGPHAKTOYCT^IDGVFWVASNQADINTPAD- IVDRDPS SDEAIPTRFPPGTVL 182 



15 



Query: 166 PKGFYATCSRGGSQASSRSS SRSRGNSKNSTPGSSRGNSPA^ 225 

P+G+Y EGS G S +SRS+SR+ N S ' SR NS * R ++ G +A D++ 
Sbjct: 183 PQGYYIEGS -GRSAPNSRSTSRA- PNRAPSAGSRSRANSGNRTSTPGVTPDMA DQI 236 



20 Query: 226 NQLESKVSGKGQQQQGQTVTKKSAAEASK KPRQKRTATKQYNVTQAFGRRGPEQTQ 281 

h GK + Q VTK++A E + KPRQKR+ KQ V Q FG+RGP Q 

Sbjct: 237 ASLVLAKLGK-DATKPQQVTKQTAKEVRQKILNKPRQKRS PNKQCTVQQCFGKRGPNQ — 293 

Query: 282 GNFGDQDLIRQGT 294 
25 NFG ++++ GT 

' Sbjct: 294 -NFGGGEMLKLGT 305 

These results indicate that SEQ ID NO: 10533 has functional similarities to a coronavirus 
nucleocapsid protein. 

30 In one embodiment, the invention comprises an amino acid sequence from the 53' Frame 1 

of Figure 127 e.g. SEQ ID NO s : 10506-10514. Some encoded open reading frames within this 
region are SEQ ID NO s : 10575 to 10578. 

Accordingly, the invention includes a polypeptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO: 10575, SEQ ID NO: 10576, SEQ ID NO: 

35 10577 and SEQ ID NO: 10578. The invention includes a polypeptide comprising an amino acid 
sequence having sequence identity to a sequence selected from the group consisting of SEQ ID 
NO: 10097, SEQ ID NO: 10576, SEQ ID NO: 10577 and SEQ ID NO: 10578. The invention 
includes a fragment of a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 10097, SEQ ID NO: 10576, SEQ ID NO: 10577 and SEQ ID NO: . 

40 10578. 

In one embodiment, the invention includes a polypeptide comprising an amino acid 
sequence from the 3'5 f Frame 2 of Hgure 127 e.g. SEQ ID NO s : 10547-10559. An open reading 
frame within this region is SEQ ID NO: 10579. 

The invention includes a polypeptide comprising an amino acid sequence of SEQ ID NO: 
45 10579. The invention includes a polypeptide comprising an amino acid sequence having 

sequence identity to SEQ ID NO: 10579. The invention includes a fragment of a polypeptide 
comprising an amino acid sequence of SEQ ID NO: 10579. 
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Hie invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
includes a polynucleotide sequence comprising one or more of the primer sequences identified in 
5 Table 33. The invention further includes polynucleotide sequence comprising the complement 
of one or more of the primer sequences identified in Table 33. 

The invention includes a polynucleotide sequence comprising SEQ ID NO: 1 1323. A 
polypeptide encoded by SEQ ID NO: 11323 is SEQ ID NO: 11324. 

The invention includes a polypeptide comprising SEQ ID NO: 1 1324, sequence having 
10 sequence identity to SEQ ID NO: 1 1324 and fragments of SEQ ID NO: 1 1324. The invention 
includes a fragment of SEQ ID NO: 1 1324, wherein said polypeptide fragment begins with a 
Methionine. 

Accordingly, the invention includes a polynucleotide sequence comprising SEQ ID NO: 
11323. It also provides polynucleotide sequences having sequence identity to SEQ ID NO: 
15 1 1323. The invention also provides for polynucleotide sequences comprising fragments of SEQ 
ID NO: 1 1323. In one embodiment, the polynucleotide fragment does not consist entirely of a 
known S ARS polynucleotide sequence or a known coronavirus polynucleotide sequence. 

The invention includes an amino acid sequence encoded by the polynucleotide sequence 
SEQ ID NO: 11323, including the amino acid sequence of SEQ ID NO: 11324. 
20 The invention also provides amino acid sequences having sequence identity to an amino 

acid sequence encoded by SEQ ID NO: 11323. The invention provides amino acid sequences 
having sequence identity to SEQ ID NO: 1 1324. 

The invention provides fragments of amino acid sequences encoded by SEQ ID NO: 
11323. The invention also provides fragments of amino acid sequences of SEQ ID NO: 11324. 
25 In one embodiment, the fragment does not consist entirely of a known S ARS amino acid 
sequence or a known coronavirus amino acid sequence. 

The invention also includes polynucleotide sequences which can be used as probes for 
diagnostic reagents, kits (comprising such reagents) and methods which can be used to diagnose 
or identify the presence or absence of a SARS virus in a biological sample. The invention 
30 includes a polynucleotide sequence comprising one or more of the primer sequences identified as 
SEQIDNO s : 11325-11440 (left part) and SEQ ID NO s : 11441-11551 (right part). The invention 
further includes polynucleotide sequence comprising the complement of one or more of the 
primer sequences identified as SEQ ID NO s : 11325-1 1551. 

The invention includes a polypeptide comprising SEQ ID NO: 11552. The SARS virus 
35 contains polymorphism at the Isoleucine residue He-324. The invention includes a polypeptide 
comprising an amino acid sequence having sequence identity to SEQ ID NO: 11552, wherein 
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said polypeptide includes an amino acid sequence selected from the group consisting of YSYAI 
(SEQ ID NO: 1 1553), SYAIH (SEQ ID NO: 11554), YAIHH (SEQ ID NO: 11555), IHHDK 
(SEQ ID NO: 11556), SYAI (SEQ ID NO: 11557), YAIH (SEQ ID NO: 11558), AIHH (SEQ ID 
NO: 1 1559), IHHD (SEQ ID NO: 11560), YAI, AIH, and IHH. The invention includes a 
5 fragment of a polypeptide comprising SEQ ID NO: 11552, wherein said fragment includes an 
amino acid sequence selected from the group consisting of YSYAI (SEQ ID NO: 11553), 
SYAIH (SEQ ID NO: 11554), YAIHH (SEQ ID NO: 11555), IHHDK (SEQ ID NO: 11556), 
SYAI (SEQ ID NO: 11557), YAIH (SEQ ID NO: 11558), AIHH (SEQ ID NO: 11559), IHHD 
(SEQ ID NO: 11560), YAI, AIH, and IHH. 

The invention includes a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 11561 and SEQ ID NO: 11562. The invention includes a 
fragment of a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 11561 and SEQ ID NO: 11562. 

The invention includes a diagnostic kit comprising a polypeptide comprising at least one of 
the amino acid sequences selected from the group consisting of SEQ ID NO s : 1 1561 and 1 1562. 
Hie invention includes a diagnostic kit comprising a polynucleotide sequence encoding a 
polypeptide comprising at least one of the amino acid sequences selected from the group • 
consisting of SEQ ID NO s : 11561 and 11562. The invention includes an immunogenic 
composition comprising a polypeptide comprising at least one of the amino acid sequences 
selected from the group consisting of SEQ ID NO s : 1 1561 and 1 1562. The invention includes 
antibody which recognizes a polypeptide comprising at least one of the amino acid sequences 
selected from the group consisting of SEQ ID NO s : 1 1561 and 11562. 

The invention includes a polynucleotide sequence SEQ ID NO: 1 1563 or a fragment 
thereof or a sequence having sequence identity thereto. Polypeptide sequences which can be 
translated from SEQ ID NO: 1 1563 are shown in Figure 128. The constituent amino acid 
sequences from Figure 128, having at least 4 amino acids, are listed as SEQ ID NO s : 11564 to 
11617. 

The invention includes a polypeptide sequence selected from the group consisting of the 
sequences of Figure 128, or a fragment thereof or a sequence having sequence identity thereto 
e.g. SEQ ID NO s : 11563 to 11617. 

A polypeptide sequence within SEQ ID NO: 11600 is SEQ ID NO: 11618. The invention 
includes a polypeptide comprising SEQ ID NO: 11618, or a fragment thereof or a sequence 
having sequence identity thereto. 
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A polypeptide sequence within SEQ ID NO: 11602 is SEQ ID NO: 11641. Hie invention 
includes a polypeptide comprising SEQ ID NO: 11641, or a fragment thereof or a sequence 
having sequence identity thereto. 

A polypeptide sequence within SEQ ID NO: 11609 is SEQ ID NO: 11619. 
5 Hie invention includes a polynucleotide encoding (i) an amino acid sequence selected from 

the group consisting of: (1) the amino acid sequences of Figure 128, and in particular SEQ ID 
NO s : 11564-11617; (2) SEQ ID NO: 11618; and (3) SEQ ID NO: 11619, or (ii) a fragment 
thereof. The invention includes a diagnostic kit comprising a one or more of these proteins. The 
invention includes a diagnostic kit comprising a polynucleotide sequence encoding one or more 

10 of these polypeptide sequences. The invention includes an antibody which recognizes one or 
more of the polypeptide sequences. 

The SARS virus may contain polymorphism at isoleucine residue He-326 in SEQ ID NO: 
11620 (CM-PEP3). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 11620, wherein said polypeptide includes an amino 

15 acid sequence selected from the group consisting of YAXHH (SEQ ID NO: 1 1621) and YATHH 
(SEQ ID NO: 11622). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 11620, wherein said fragment includes an amino acid sequence selected from the group 
consisting of YAJHH (SEQ ID NO: 1 1621) and YATHH (SEQ ID NO: 1 1622). 

The SARS virus may contain polymorphism at gjutamine residue Gln-830 in SEQ ID NO: 

20 11620. The invention includes a polypeptide comprising an amino acid sequence having 
sequence identity to SEQ ID NO: 1 1620, wherein said polypeptide includes an amino acid 
sequence selected from the group consisting of ASQAW (SEQ ID NO: 1 1623) and ASRAW (SEQ 
ID NO: 1 1624). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
11620, wherein said fragment includes an amino acid sequence selected from the group 

25 consisting of ASQAW (SEQ ID NO: 11623) and ASRAW (SEQ ID NO: 11624). 

The SARS virus may contain polymorphism at aspartic acid residue Asp-935 in SEQ ID 
NO: 11620. The invention includes a polypeptide comprising an amino acid sequence having 
sequence identity to SEQ ID NO: 11620, wherein said polypeptide includes an amino acid 
sequence selected from the group consisting of DADST (SEQ ID NO: 1 1625) and DAYST (SEQ 

30 ID NO: 1 1626). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
1 1620, wherein said fragment includes an amino acid sequence selected from the group 
consisting of DADST (SEQ ID NO: 11625) and DAYST (SEQ ID NO: 11626). 

The SARS virus may contain polymorphism at serine residue Ser-577 in SEQ ID NO: 
11627 (Chi-PEP4). Hie invention includes a polypeptide comprising an amino acid sequence 

35 having sequence identity to SEQ ID NO: 11627, wherein said polypeptide includes an amino 
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acid sequence selected from the group consisting of PCSFG (SEQ ED NO: 11628) and PCAFG 
(SEQ ID NO: 1 1629). The invention includes a fragment of a polypeptide comprising SEQ ID . 
NO: 11627, wherein said fragment includes an amino acid sequence selected from the group 
consisting of PCSFG (SEQ ID NO: 11628) and PCAFG (SEQ ID NO: 11629). 
5 Hie SARS virus may contain polymorphism at valine residue Val-68 in SEQ ID NO: 

11630 (Chi-PEP8). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 1 1630, wherein said polypeptide includes an amino 
acid sequence selected from the group consisting of LAWY (SEQ ID NO: 11631) and LAAVY 
(SEQ ID NO: 1 1632). The invention includes a fragment of a polypeptide comprising SEQ ID 

10 NO: 11630, wherein said fragment includes an amino acid sequence selected from the group 
consisting of LAWY (SEQ ID NO: 11631) and LAAVY (SEQ ID NO: 11632). 

The SARS virus may contain polymorphism at isoleucine residue De-50 in SEQ ID NO: 
11633 (Chi-PEP13). The invention includes a polypeptide comprising an amino acid sequence 
having sequence identity to SEQ ID NO: 11633, wherein said polypeptide includes an amino 

15 acid sequence selected from the group consisting of NNTAS (SEQ ID NO: 1 1634) and NNTAS 
(SEQ ID NO: 11635). The invention includes a fragment of a polypeptide comprising SEQ ID 
NO: 11633, wherein said fragment includes an amino acid sequence selected from the group 
consisting of NNXAS (SEQ ID NO: 11634) and NNTAS (SEQ ID NO: 11635). 

The SARS virus may contain a polymorphism at Serine residue Ser-943 in SEQ ID NO: 

20 1 1636. The invention includes a polypeptide comprising an amino acid sequence having 
sequence identity to SEQ ID NO: 11636, wherein said polypeptide includes an amino acid 
sequence selected from the group consisting of AVSAC (SEQ ID NO: 1 1637) and AVGAC (SEQ 
ID NO: 11638). The invention includes a fragment of a polypeptide comprising SEQ ID NO: 
11636, wherein said fragment includes an amino acid seuence selected from the group consisting 

25 of AVSAC (SEQ ID NO: 11637) and AVGAC (SEQ ID NO: 11638). 

The invention includes a polynucleotide SEQ ID NO: 11639, or a fragment thereof or a 
sequence having sequence identity thereto. The invention includes a polypeptide encoded by the 
polynucleotide sequence set forth in SEQ ID NO: 11639, or a fragment thereof or a polypeptide 
sequence having sequence identity thereto. 

$0 The invention includes a polynucleotide set forth in SEQ ID NO: 11640, or a fragment 

thereof or a sequence having sequence identity thereto. The invention includes a polypeptide 
encoded by the polynucleotide sequence set forth in SEQ ID NO: 1 1640, or a fragment thereof or 
a polypeptide sequence having sequence identity thereto. 

The invention includes each of the polynucleotides identified above. The invention 

15 includes each of the polynucleotides set forth in the sequence listing. The invention further 
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includes polynucleotides having sequence identity to each of the polynucleotides identified 
above. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 61%, 62%, 
63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 
79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 
5 95%, 96%, 97%, 98%, 99% or more). 

The invention includes polynucleotide sequences comprising fragments of each of the 
polynucleotide sequences identified above. The fragments should comprise at least n 
consecutive polynucleotides from a particular SEQ ID NO:, and, depending on the sequence, n is 
7 or more {e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 

10 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120, 130, 140, 
150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

. The invention includes each of the amino acid sequences encoded by each of the 
polynucleotide sequences identified above. The invention includes each of the amino acid 
sequences encoded by each of the polynucleotide sequences set forth in the sequence listing. 

15 The invention further includes amino acid sequences having sequence identity to the amino acid 
sequences encoded by each of the polynucleotide sequences identified above. The degree of 
sequence identity is preferably greater than 50% (e.g., 60%, 61%, 62%, 63%, 64%, 65%, 66%, 
67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 
83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 

20 99% or more). The invention further includes fragments of amino acid sequences encoded by 
each of the polynucleotide sequences identified above. Hie fragments should comprise at least n 
consecutive amino acids from a particular SEQ ID NO:, and, depending on the sequence, n is 7 
or more (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120, 130, 140, 150, 

25 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

The invention includes each of the amino acid sequences identified above. Hie invention 
includes each of the amino acid sequence set forth in the sequence listing. The invention further 
includes amino acid sequences having sequence identity to each of the amino acid sequences 
identified above. The degree of sequence identity is preferably greater than 50% (e.g., 60%, 

30 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 
77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99% or more). 

The invention further includes fragments of the amino acid sequences identified above. 
The fragments should comprise at least n consecutive amino acids from a particular SEQ ID 

35 NO:, and, depending on the sequence, n is 7 or more (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 
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60, 65, 70, 80, 90, 100, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 
260, 270, 280, 290, 300 or more). 

The invention includes polynucleotides encoding each of the amino acid sequences 
identified above. Hie invention includes polynucleotides encoding each of the amino acid 
5 sequences set forth in the sequence listing. The invention further includes polynucleotides 
having sequence identity with each of the polynucleotides encoding each of the amino acid 
sequences identified above. The degree of sequence identity is preferably greater than 50% (e.g., 
60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 

10 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more). 

The invention further includes fragments of polynucleotides encoding each of the amino 
arid sequences identified above. The fragments should comprise at least n consecutive 
polynucleotides from a particular SEQ ID NO:, and, depending on the sequence, n is 7 or more 
(e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 1*U7, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 

15 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 80, 90, 100, 120, 130, 140, 150, 160, 
170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300 or more). 

As described in more detail below, polynucleotides for use as primers and/or as probes 
may contain at least 4 or 8 contiguous nucleotides from a polynucleotide sequence of the 
invention e.g. at least 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 

20 contiguous nucleotides and up to about 50, 75, 100, 200 contiguous nucleotides or more. While 
6-8 nucleotides may be a workable length, sequences of 10-12 nucleotides are preferred; and 
about 13, 14, 15, 16, 17, 18, 19, 20, or 21 or more nucleotides or more appears optimal for 
hybridisation. 

In one embodiment, the invention is directed to polynucleotides and amino acid sequences 
15 that do not consist entirely of a known S ARS virus polynucleotide or amino arid sequenbe or of 
a known coronavirus polynucleotide or amino acid sequence. In one embodiment, the 
polynucleotides and amino arid sequences of the invention do not consist entirely of the 
sequence SEQ ID NO: 1. In another embodiment, the polynucleotides and amino arid sequences 
of the invention do not consist entirely of the sequence SEQ ID NO: 2. SEQ ID NO: 9967 is a 
JO SARS genome sequence of the Frankfurt (ERA) isolate (GenBank AY310120). Compared to 
SEQ ID NO: 1, it differs at nucleotides 2546, 2590, 11437, 18954, 19073, 20585, 20899, 23209, 
24922, 26589 & 28257; compared to SEQ ID NO:2, it differs at nucleotides 2560, 7922, 11451, 
16625, 18968 & 19067. Further genome sequences have become available from GenBank, since 
this application was originally filed, under accession numbers including AY559097, AY559096, 
6 AY559095, AY559094, AY559093, AY559092, AY559091, AY559090, AY559089, AY559088, 
AY559087, AY559086, AY559085, AY559084, AY559083, AY559082, AY559081, AY274U9, 
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AY323977, AY291315, AY502932, AY502931, AY5Q2930, AY502929, AY502928, AY502927, 
AY502926, AY502925, AY502924, AY502923, AY291451, AY390556, AY395003, AY395002, 
AY395001, AY395000, AY394999, AY394998, AY394997, AY394996, AY394995, AY394994, 
AY394993, AY394992, AY394991, AY394990, AY394989, AY394987, AY394986, AY394985, 
5 AY394983, AY394979, AY394978, AY508724, AY394850, AY463059, AY463060, AY313906, 
AY310120, AY461660, AY485278, AY485277, AY345988, AY345987, AY345986, AY282752, 
AY357076, AY357075, AY350750, AY304495, AY304488, AY304486, AY427439, AY283798, 
AY278491, AY278489, AY362699, AY362698, AY283797, AY283796, AY283795, AY283794, 
AY278741, AY351680, AP006561, APG06560, AP006559, AP006558, AP006557, AY278554, 

10 AY348314, AY338175, AY338174, AY321118, AY279354, AY278490, AY278487, AY297028, 
AY278488, and NCJ)04718. 

In another embodiment, the invention is directed to polynucleotides that encode proteins 
which are not immunologically cross reactive with a protein of a mouse hepatitis virus, a bovine 
corona virus or an avian infectious bronchitis virus. In another embodiment, the invention is 

15 directed to proteins which are not immunologically cross reactive with a protein of a mouse 
hepatitis virus, a bovine coronavirus or an avian infectious bronchitis virus. 

' . Each of the polynucleotides identified above may be used to encode a portion of a fusion 
protein. Accordingly, the invention complies one or more of the polynucleotides identified 
above wherein the polynucleotides encoding for the start codon are removed. The invention 

20 further comprises one or more of the amino acids identified above wherein the starting 
methionine is removed. 

Any of the polynucleotide or amino acid sequences discussed above may be used in 
vaccines for the treatment or prevention of SARS virus infection, including as a S ARS viral 
antigen. Additionally, any of the polynucleotides or amino acid sequences discussed above may 

25 be used as diagnostic reagents, or in kits (comprising such reagents) or in methods used to 
diagnose or identify the presence or absence of a SARS virus in a biological sample. 

SARS viral antigens of the invention may include a polypeptide with 99%, 95%, 90%, 
85%, or 80% homology to one or more of the group consisting of the following proteins: 
nonstructural protein 2 (NS2); hemagglutinin-esterase glycoprotein (HE) (also referred to as E3), 

30 spike glycoprotein (S) (also referred to as E2), nonstructural region 4 (NS4), envelope (small 
membrane) protein (E) (also referred to as sM), membrane glycoprotein (M) (also referred to as 
El), nucleocapsid phosphoprotein (N) or RNA dependent RNA polymerase (pol). 

A detailed discussion of Coroavirus biology can be found in Fields Virology (2nd ed), 
Relds et aL (eds.), B.N. Raven Press, New York, NY., Chapter 35. 

35 Another example of a SARS virus isolate is set forth in Example 1 below. The invention 

includes each of the polypeptide and polynucleotide sequences identified in Example 1. In 
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addition, the invention includes vaccine formulations comprising one or more of the polypeptide 
or polynucleotide sequences identified in Example 1. Hie invention includes diagnostic 
regaents, kits (comprising such reagents) and methods which can be used to diagnose or identify 
the presence or absence of a SARS virus in a biological sample using one or more of the 
5 polypeptide or polynucleotide sequences identified in Example 1. The invention includes 
methods for the treatment or prevention of SARS virus infection utilizing small molecule viral 
inhibitors and combinations of small molecule viral inhibitors and kits for the treatment of 
SARS. The small molecule inhibitors may specifically target one or more of the polypeptides or 
polynucleotides identified in Example 1. 

10 Further discussion of terms used in die application follows below. 

"Respiratory Virus" as used herein refers to a virus capable of infecting the human 
respiratory tract Respiratory Viral Antigens suitable for use in the invention include Severe 
Acute Respiratory Syndrome virus, coronavirus, influenza virus, human rhinovirus (HRV), 
parainfluenza virus (PIV), respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and 

15 rhinovirus. 

The terms "polypeptide", "protein" and "amino acid sequence" as used herein generally 
refer to a polymer of amino acid residues and are not limited to a minimum length of the product 
Thus, peptides, oUgopeptides,idimers, mulimers, and the like, are included within the definition. 
Both full-length proteins and fragments thereof are encompassed by the definition. Minimum 

20 fragments of polypeptides useful in the invention can be at least 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 
14, or even 15 amino acids. Typically, polypeptides useful in this invention can have a 
maximum length suitable for the intended application. Generally, the maximum length<is not 
critical and can easily be selected by one skilled in the art 

Polypeptides of the invention can be prepared in many ways e.g. by chemical synthesis (at 

25 least in part), by digesting longer polypeptides using proteases, by translation from RNA, by 
purification from cell culture (e.g. from recombinant expression), from the organism itself (e.g. 
after viral culture, or direct from patients), from a cell line source etc. A preferred method for 
production of peptides <40 amino acids long involves in vitro chemical synthesis (Bodanszky 
(1993) Principles of Peptide Synthesis (ISBN: 0387564314); Fields et aL (1997) Methods in 

30 Enzymology 289: Solid-Phase Peptide Synthesis. ISBN: 0121821900). Solid-phase peptide 
synthesis is particularly preferred, such as methods based on t-Boc or Fmoc (Chan & White 
(2000) Fmoc Solid Phase Peptide Synthesis ISBN: 0199637245) chemistry. Enzymatic synthesis 
(Kullmann (1987) Enzymatic Peptide Synthesis. ISBN: 0849368413) may also be used in part or 
in full. As an alternative to chemical synthesis, biological synthesis may be used e.g. the 

*5 polypeptides may be produced by translation. This may be carried out in vitro or in vivo. 

Biological methods are in general restricted to the production of polypeptides based on L-amino 
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acids, but manipulation of translation machinery (e.g. of aminoacyi tRNA molecules) can be 
used to allow the introduction of D-amino acids (or of other non natural amino acids, such as 
iodotyrosine or methylphenylalanine, azidohomoalanine, etc.) (Ibba (1996) Biotechnol Genet 
Eng Rev 13:197-216.). Where D-amino acids are included, however, it is preferred to use 
5 chemical synthesis. Polypeptides of the invention may have covalent modifications at the 
C-terminus and/or N-terminus, particularly where they are for in vivo administration e.g by 
attachment of acetyl or carboxamide, as in the Fuzeon™ product 

Reference to polypeptides and the like also includes derivatives of the amino acid 
sequences of the invention. Such derivatives can include postexpression modifications of the 

10 polypeptide, for example, glycosylation, acetylation, phosphorylation, and the like. Amino acid 
derivatives can also include modifications to the native sequence, such as deletions, additions 
and substitutions (generally conservative in nature), so long as the protein maintains the desired 
activity. These modifications may be deliberate, as through site-directed mutagenesis, or may be 
accidental, such as through mutations of hosts which produce the proteins or errors due to PCR 

15 amplification. Furthermore, modifications may be made that have one or more of the following 
effects: reducing toxicity; facilitating cell processing (e.g., secretion, antigen presentation, etc.); 
and facilitating presentation to B-cells and/or T-cells. 

"Fragment" or "Portion" as used herein refers to a polypeptide consisting of only a part of 
the intact full-length polypeptide sequence and structure as found in nature. For instance, a 

10 fragment can include a C-terminal deletion and/or an N-terminal deletion of a protein. , 
A "recombinant" protein is a protein which has been prepared by recombinant DNA 
techniques as described herein. In general, the gene of interest is cloned and then expressed in 
transformed organisms, as described further below. Hie host organism expressed the foreign 
gene to produce the protein under expression conditions. 

5 The term "polynucleotide", as known in the art, generally refers to a nucleic acid molecule. 

A "polynucleotide" can include both double- and single-stranded sequences and refers to, but is 
not limited to, cDNA from viral, prokaryotic or eukaryotic mRNA, genomic RNA and DNA 
sequences from viral (e.g. RNA and DNA viruses and retroviruses) or prokaryotic DNA, and 
especially synthetic DNA sequences. The term also captures sequences that include any of the 

0 known base analogs of DNA and RNA, and includes modifications such as deletions, additions 
and substitutions (generally conservative in nature), to the native sequence, so long as the nucleic 
acid molecule encodes a therapeutic or antigenic protein. These modifications may be 
deliberate, as through site-directed mutagenesis, or may be accidental, such as through mutations 
of hosts that produce the antigens. Modifications of polynucleotides may have any number of 

5 effects including, for example, facilitating expression of the polypeptide product in a host cell. 
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Polynucleotides of the invention may be prepared in many ways e.g. by chemical synthesis 
(e.g. phosphoramidite synthesis of DNA) in whole or in part, by digesting longer nucleic acids 
using nucleases (e.g. restriction enzymes), by joining shorter nucleic acids or nucleotides (e.g. 
using ligases or polymerases), from genomic or cDNA libraries, etc. 
5 A polynucleotide can encode a biologically active (e.g. , immunogenic or therapeutic) 

protein or polypeptide. Depending on the nature of the polypeptide encoded by the 
polynucleotide, a polynucleotide can include as little as 10 nucleotides, e.g., where the 
polynucleotide encodes an antigen. 

By "isolated" is meant, when referring to a polynucleotide or a polypeptide, that the 

10 indicated molecule is separate and discrete from the whole organism with which the molecule is 
found in nature or, when the polynucleotide or polypeptide is not found in nature, is sufficiently 
free of other biological macromolecules so that the polynucleotide or polypeptide can be used for 
its intended purpose. The polynucleotides and polypeptides of the invention are preferably 
isolated polynucleotides and isolated polypeptides. 

15 "Antibody*' as known in the art includes one or more biological moieties that, through 

chemical or physical means, can bind to or associate with an epitope of a polypeptide of interest 
The antibodies of the invention include antibodies which specifically bind to a SARS viral 
antigen. The term "antibody" includes antibodies obtained from both polyclonal and monoclonal 
preparations, as well as the following: hybrid (chimeric) antibody molecules (see, for example, 

>0 Winter et al. (1991) Nature 349: 293-299; and US Patent No. 4,816,567; F(ab')2 and F(ab) 
fragments; F v molecules (non-covalent heterodimers, see, for example, Inbar et al (1972) Proc 
Natl Acad Sci USA 69:2659-2662; and Ehrlich et al. (1980) Biochem 19:4091-4096); single- 
chain Fv molecules (sFv) (see, for.example, Huston et al. (1988) Pfoc Natl Acad Sci USA 
85:5897-5883); dimeric and trimeric antibody fragment constructs; minibodies (see, e.g., Pack 

15 et aL (1992) Biochem 31:1579-1584; Cumber et al. (1992) J Immunology 149B : 120-126); 

humanized antibody molecules (see, for example, Riechmann et aL (1988) Nature 332:323-327; 
Verhoeyan et al. (1988) Science 239:1534-1536; and U.KL Patent Publication No. GB 2,276,169, 
published 21 September 1994); and, any functional fragments obtained from such molecules, 
wherein such fragments retain immunological binding properties of the parent antibody 

»0 molecule. The term "antibody* * further includes antibodies obtained through non-conventional 
processes, such as phage display. 

As used herein, the term "monoclonal antibody" refers to an antibody composition having 
a homogeneous antibody population. The term is not limited regarding the species or source of 
the antibody, nor is it intended to be limited by the manner in which it is made. Thus, the term 

5 encompasses antibodies obtained from murine hybridomas, as well as human monoclonal 



-102- 



WO 2004/092360 



PCT/US2004/011710 



antibodies obtained using human rather than murine hybridomas. See, e.g., Cote, et aL 
Monoclonal Antibodies and Cancer Therapy, Alan R. liss, 1985, p 77. 

An "immunogenic composition" as used herein refers to a composition that comprises an 
antigenic molecule where administration of the composition to a subject results in the 
5 development in the subject of a humoral and/or a cellular immune response to the antigenic 
molecule of interest Hie immunogenic composition can be introduced directly into a recipient 
subject, such as by injection, inhalation; oral, intranasal or any other parenteral, mucosal or 
transdermal {e.g., intra-rectally or intra-vaginally) route of administration. 

Hie term "derived from" is used to identify the source of molecule {e.g., a molecule can be 

10 derived from a polynucleotide, polypeptide, an immortalized cell line can be derived from any 
tissue, etc.). A first polynucleotide is "derived from" a second polynucleotide if it has the same 
or substantially the same basepair sequence as a region of the second polynucleotide, its cDNA, 
complements thereof, or if it displays sequence identity as described above. Thus, a first 
polynucleotide sequence is "derived from" a second sequence if it has (i) the same or * 

L5 substantially the same sequence as the second sequence or (ii) displays sequence identity to 
polypeptides of that sequence. 

A first polypeptide is "derived from" a second polypeptide if it is (i) encoded by a first 
polynucleotide derived from a second polynucleotide, or (ii) displays sequence identity to the 
second polypeptides as described above. Thus, a polypeptide (protein) is "derived from" a 

iO particular S ARS virus if it is (i) encoded by an open reading frame of a polynucleotide of that 

SARS virus, or (ii) displays sequence identity, as described above, to polypeptides of that SARS - 
virus. 

Both polynucleotide and polypeptide molecules can be physically derived from a SARS 
virus or produced recombinantly or synthetically, for example, based on known sequences. 

55 A cultured cell or cell line is "derived from" another cell, cells or tissue if it is originally 

obtained from existing cells or tissue. Non-limiting examples of tissue that cells may be derived 
from include skin, retina, liver, kidney, heart, brain, muscle, intestinal, ovary, breast, prostate, 
cancerous tissue, tissue infected with one or more pathogens (e.g., viruses, bacteria etc.) and the 
like. The cells described herein may also be derived from other cells including, but not limited 

0 to, primary cultures, existing immortalized cells line and/or other isolated cells. 

An "antigen" refers to a molecule containing one or more epitopes (either linear, 
conformational or both) that will stimulate a host's immune system to make a humoral and/or 
cellular antigen-specific response. The torn is used interchangeably with the term 
"immunogen." Normally, an epitope will include between about 3-15, generally about 5-15 

5 amino acids. A B-cell epitope is normally about 5 amino acids but can be as small as 3-4 amino 
acids. A T-cell epitope, such as a CTL epitope, will include at least about 7-9 amino acids, and a 
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helper T-cell epitope at least about 12-20 amino acids. Nonnally, an epitope will include 
between about 7 and 15 amino acids, such as, 9, 10, 12 or 15 amino acids. The term "antigen" 
denotes both subunit antigens, (ie., antigens which are separate and discrete from a whole 
organism with which the antigen is associated in nature), as well as, killed, attenuated or 
inactivated bacteria, viruses, fungi, parasites or other microbes as well as tumor antigens, 
including extracellular domains of cell surface receptors and intracellular portions that may 
contain T-cell epitopes. Antibodies such as anti-idiotype antibodies, or fragments thereof, and 
synthetic peptide mimotopes, which can mimic an antigen or antigenic determinant, are also 
captured under the definition of antigen as used herein. Similarly, an oligonucleotide or 
polynucleotide that expresses an antigen or antigenic determinant in vivo, such as in gene therapy 
and DNA immunization applications, is also included in the definition of antigen herein. 

An -immunological response" to an antigen or composition is the development in a 
subject of a humoral and/or a cellular immune response to an antigen present in the composition 
of interest For purposes of the present invention, a "humoral immune response" refers to an 
immune response mediated by antibody molecules, including secretory (IgAyor IgG molecules, 
while a "cellular immune response" is one mediated by T-lymphocytes and/or other white blood 
cells. One important aspect of cellular immunity involves an antigen-specific response by 
cytolytic T-cells ("CTL"s). CTLs have specificity for peptide antigens that are presented in 
association with proteins encoded by the major histocompatibility complex (MHC) and 
expressed on the surfaces of cells. CTLs help induce and promote the destruction of intracellular 
microbes, or the lysis of cells infected with such microbes. Another aspect of cellular immunity . 
involves an antigen-specific response by helper T-cells. Helper T-cells act to help stimulate the 
function, and focus the activity of, nonspecific effector cells against cells displaying peptide 
antigens in association with MHC molecules on their surface. A "cellular immune response" 
also refers to the production of cytokines, chemokines and other such molecules produced by 
activated T-cells and/or other white blood cells, including those derived from CD4+ and CD8+ 
T-cells. In addition, a chemokine response may be induced by various white blood or 
endothelial cells in response to an administered antigen. 

II. VACCINE FORMULATIONS 

The invention relates to vaccine formulations for the treatment or prevention of Severe 
Acute Respiratory Syndrome (SARS). Vaccine formulations of the invention include an 
inactivated (or killed) SARS virus, an attenuated SARS virus, a split SARS virus preparation and 
a recombinant or purified subunit formulation of one or more SARS viral antigens. The 
invention includes polypeptides and polynucleotides encoding for SARS viral antigens and 
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immunogenic fragments thereof. Expression and delivery of the polynucleotides of the invention 
may be facilitated via viral vectors and/or viral particles, including Virus Like Particles (VLPs). 
A Inactivated (or Killed) SARS Vaccines 

The invention includes a composition comprising an inactivated (or killed) SARS virus and 

5 methods for the production thereof. Inactivated SARS viral compositions can be used as 

prophylactic or therapeutic SARS virus vaccine. Preferably the inactivated SARS virus vaccine 
composition comprises an amount of inactivated SARS virus which, before inactivation, is 
equivalent to a virus titer of from about 4 to 7 logs plaque forming units (PFU) or 4 to 7 logs 
tissue culture infectious dose 50 (TCID50) per milliliter. More preferably, before inactivation the 

10 virus titer is from 4 to 11, 7 to 11 or 9 to 11 PFU or TCID50. Still more preferably the 

inactivated SARS virus vaccine composition comprises an amount of inactivated SARS virus 
which, before inactivation, is equivalent to a virus titer of from about 5 to 9 PFU or 5 to 9 
TCID50 per milliliter. In one embodiment, thePFUorTCIDsoof the cultured SARS virus at 
harvest is 6 to 8, more preferably about 7.5 PFU or TCID50 per milliliter. Upon concentration of 

15 : the viral harvest, the PFU or TCID50 is preferably 8 to 1 1 , still more preferably about 9 PFU or 
TCED50 per milliliter. The vaccine composition comprises a sufficient amount of the SARS virus 
antigen to produce an immunological response in a primate. 

' Methods of inactivating or killing viruses are known in the art to destroy the ability of the 
viruses to infect mammalian cells. Such methods include both chemical or physical means. 

10 Chemical means for inactivating a SARS viruis include treatment of the virus with an effective 
amount of one or more of the following agents: detergents, formaldehyde, formalin, p- 
propiolactone, or UV light. Additional chemical means for inactivation include treatment with 
methylene blue, psoralen, carboxyfullerene (C60) or a combination of any thereof . Other 
methods of viral inactivation are known in the art, such as for example binary ethylamine, acetyl 

15 ethyleneimine, or gamma irradiation. 

For example formaldehyde may be used at concentrations such as 0.1 to 0.02%, preferably 
at 0.02 to 0.1 %, and still more preferably at 0.04 to 0.05%. The inactivating agent is added to 
virus containing culture supernatants prior to or after harvesting said culture supernatants from 
vessels used for virus propagation, either with or without a step of cell disruption for release of 

0 cell-associated virus prior to harvesting. Further, the inactivating agent may be added after said 
culture supernatants have been stored frozen and thawed, or after one or more steps of 
purification to remove cell contaminants. Preferably, however, formaldehyde is added after 
removal of cells and cellular debris or after one or more purification steps. After addition of 
formaldehyde, the virus containing mixture is transferred into an incubation vessel and incubated 

5 at refrigeration temperatures (e.g. +2 to 8°C) or alternatively at elevated temperatures, such as 
ambient temperatures between approximately 20 and 30°C or at 33°C to 37°C for a period of 12 
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hours to 7 days, whereby the temperature chosen should be adjusted to the duration of 
incubation. Prefered conditions are e.g. +2 -8°C for 3-7 days (prefered are 3 -7days), ambient 
temperatures and incubation for 16 hours to 3 days (prefered 24- 48 hours), or 35-37°C for 12-36 
hours. If it is desirable to remove excess formalin, sodium thiosulf ate or sodium metabisulfite at 

5 equimolar or 1 .5 -fold molar concentration (relative to formaldehyde) may be added after 
completing the inactivation process. 

For example, p-propiolactone may be used at concentrations such as 0.01 to 0.5%, 
preferably at 0.5% to 0.2%, and still more preferably at 0.025 to 0.1%. Hie inactivating agent is 
added to virus containing culture supernatants (virus material) prior to or after harvesting said 

10 culture supernatants from vessels used for virus propagation, either with or without a step of cell 
disruption for release of cell-associated virus prior to harvesting. Further, the inactivating agent 
may be added after said culture supernatants have been stored frozen and thawed, or after one or 
more steps of purification to remove cell contaminants. P-propiolactone is added to the virus 
material, with the adverse shift in pH to acidity being controlled with sodium hydroxide (e.g. y 1 

.5 N NaOH), a Tris-buffer or sodium bicarbonate solution. After transfering the mixture to another 
inactivation vessel, the combined inactivating agent-virus materials are incubated at temperatures 
from 4°C to 37°C, for incubation times of preferably 24 to 72 hours. 

Another inactivant which may be used is binary ethyleneimine (BEI). Equal volumes of a 
0.2 molar bromoethylamine hydrobromide solution and a 0.4 molar sodium hydroxide solution 

0 are mixed and incubated at about 37°C. for 60 minutes. The resulting cyclized inactivant is 

binary ethyleneimine, which is added to the virus materials at 0.5 to 4 percent, and preferably at 
1 to 3 percent, volume to volume. The inactivating virus materials are held from about 4°C to 
37°C for 24 to 72 hours with periodic agitation. At the end of this incubation 20 ml. of a sterile 1 
molar sodium thiosulf ate solution was added to insure neutralization of the BEL 

5 In one embodiment, the invention includes an inactivating method is designed to maximize 

exposure of the virus to the inactivating agent and to minimize long-term exposure of the 
temperature sensitive SARS virus particles to elevated temperatures. The invention includes an 
inactivation method comprising exposing the virus to the inactivation agent (such as BPL) for 12 
to 24 hours at refrigeration temperatures followed by hydrolysis of any residual inactivating 

0 agent by elevating the temperature for only 3 hours. Preferably, the refrigeration temperatures 
are between 0 and 8°C, more preferably around 4°C. Preferably, the elevated temperature is 
between 33 and 41°C, more preferably around 37°C. As assessed by a test for residual infectious 
virus using 10 ml aliquots of the inactivated preparation, the method is able to inactivate SARS- 
CoV in raw cell culture harvests below a theoretical limit of 0.03 infectious units/ml. 

5 Diluted and undiluted samples of the inactivated virus materials are added to susceptible 

cell (tissue) culture (e.g., VERO) to detect any non-inactivated virus. The cultured cells are 
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passaged multiple times and examined for the presence of SARS virus based on any of a variety 
of methods, such as, for example, cytopathic effect (CPE) and antigen detection (e.g. 7 via 
fluorescent antibody conjugates specific for SARS virus). Such tests allow determination of 
complete virus inactivation. 
5 Prior to inactivation, the SARS virus will be cultured in a mammalian cell culture. The cell 

culture may be adherently growing cells or cells growing in suspension. Preferably the cells are 
of mammalian origin, but may also be derived from avian (e.g., hens' cells such as hens* embryo 
cells (CEF cells)), amphibian, reptile, insect, or fish sources. Mammalian sources of cells 
include, but are not limited to, human or non-human primate (e.g„ MRC-5 (ATCC CCL-171), 

10 WI-38 (ATCC CCL-75), HeLa cells, human diploid cells, fetal rhesus lung cells (e.g. ATCC CL~ 
160), human embryonic kidney cells (293 cells, typically transformed by sheared adenovirus 
type 5 DNA), YERO cells (e.g., from monkey kidneys), horse, cow (e.g., MDBK cells), sheep, 
dog (e.g., MDCK cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, 
deposit number DSM ACC 2219 as described in WO 97/37001), cat, and rodent (e.g., hamster 

15 cells such as BHK21-F, HKCC cells, or Chinese hamster ovary cells (CHO cells)), and may be 
obtained from a wide variety of developmental stages, including for example, adult, neonatal, 
fetal, and embryo. 

In certain embodiments the cells are immortalized (e.g., PERC.6 cells are described, for 
example, in WO 01738362 and WO 02/40665, incorporated by reference herein in their , 
M) entireties, as well as deposited under ECACC deposit number 96022940), or any other cell type 
immortalized using the techniques described herein. 

In preferred embodiments, mammalian cells are utilized, and may be selected from and/or 
derived from one or more of the following non-limiting cell types: fibroblast cells (e.g. 9 dermal, 
lung), endothelial cells (e.g., aortic, coronary, pulmonary, vascular, dermal microvascular, 
15 umbilical), hepatocytes, keratinocytes, immune cells (e.g. , T cell, B cell, macrophage, NK, 

dendritic), mammary cells (e.g., epithelial), smooth muscle cells (e.g., vascular, aortic, coronary, 
arterial, uterine, bronchial, cervical, retinal pericytes), melanocytes, neural cells (e.g. y astrocytes), 
prostate cells (e.g., epithelial, smooth muscle), renal cells (e.g., epithelial, mesangial, proximal 
tubule), skeletal cells (e.g., chondrocyte, osteoclast, osteoblast), muscle cells (e.g., myoblast, 
0 skeletal, smooth, bronchial), liver cells, retinoblasts, and stromal cells. WO 97/37000 and WO 
97/37001, incorporated by reference herein in their entireties, describe production of animal cells 
and cell lines that capable of growth in suspension and in serum free media and are useful in the 
production and replication of viruses. 

Preferably, the SARS viruses of the invention are grown on VERO cells or fetal rhesus 
5 kidney cells. 
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Culture conditions for the above cell types are well-described in a variety of publications, 
or alternatively culture medium, supplements, and conditions may be purchased commercially, 
such as for example, as described in the catalog and additional literature of Camtarex Bioproducts 
(East Rutherford, NJ). 

5 In certain embodiments, the host cells used in the methods described herein are cultured in 

serum free and/or protein free media. A medium is referred to as a serum-free medium in the 
context of the present invention in which there are no additives from serum of human or animal 
origin. Protein-free is understood to mean cultures in which multiplication of the cells occurs 
with exclusion of proteins, growth factors, other protein additives and non-serum proteins. The 

10 cells growing in such cultures naturally contain proteins themselves. 

Known serum-free media include Iscove's medium, Ultra-CHO medium (BioWhittaker) or 
EX-CRT J, (JRH.Bioscience). Ordinary serum-containing media include Eagle's Basal Medium 
(BME) or Minimum Essential Medium (MEM) (Eagle, Science, 130, 432 (1959)) or Dulbecco's 
Modified Eagle Medium (DMEM or EDM), which are ordinarily used with up to 10% fetal calf 

IS serum or similar additives. Optionally, Minimum Essential Medium (MEM) (Eagle, Science, 
130, 432 (1959)) or Dulbecco's Modified Eagle Medium (DMEM or EDM) may be used without 
any serum containing supplement. Protein-free media like PF-CHO (JHR Bioscience), 
chemically-defined media like ProCHO 4CDM (BioWhittaker) or SMIF 7 (Gibco/BRL life 
Technologies) and mitogenic peptides like Primactone, Pepticase or HyPep™ (all from Quest 

10 International) or lactalbumin hydrolyzate (Gibco and other manufacturers) are also adequately 
known in the prior art. The media additives based on plant hydrolyzates have the special 
advantage that contamination with viruses, mycoplasma or unknown infectious agents can be 
ruled out 

, The cell culture conditions to be used for the desired application (temperature, cell density, 
£5 pH value, etc.) are variable over a very wide range owing to the suitability of the cell line 

employed according to the invention and can be adapted to the requirements of the SARS virus. 

Hie method for propagating the SARS virus in cultured cells (e.g., mammalian cells) 
includes the steps of inoculating the cultured cells with SARS virus, cultivating the infected cells 
for a desired time period for virus propagation, such as for example as determined by SARS 
»0 virus titer or SARS virus antigen expression (e.g., between 24 and 168 hours after inoculation) 
and collecting the propagated virus. The cultured cells are inoculated with a SARS virus 
(measured by PFU or TQD50) to cell ratio of 1:10000 to 1:10. A lower range of ratios may also 
be used e.g. 1:500 to 1:1, preferably 1:100 to 1:5, more preferably 1:50 to 1:10. The SARS virus 
is added to a suspension of the cells or is applied to a monolayer of the cells, and the virus is 
S absorbed on the cells for at least 60 minutes but usually less than 300 minutes, preferably 
between 90 and 240 minutes at 25°C to 40°C, more preferably 28°C to 37°C, still more 

-108- 



WO 2004/092360 PCT/US2004/011710 

preferably at about 33 °C. The infected cell culture (e.g. , monolayers) may be treated either by 
freeze-thawing or by enzymatic action to increase the viral content of the harvested culture 
supernatants. The harvested fluids are then either inactivated or stored frozen. 

A comparison of SARS infected Vera cells grown with and without fetal calf serum 
("FCS") is shown in FIGURE 26A. Briefly, Vero cells were split the day before infection and 
cultivated in T175 flasks. Infection of 90% confluent Vero cell monolayers the following day 
was performed with a SARS-CoV seed stock (strain FRA, passage 4, Accession number 
AY310120), with or without 3% PCS (Fig. 26A). The addition of PCS to the cell media showed 
little impact on virus yield. 

Cultured cells may be infected at a mwtiplicity of infection ("m.o.i.") of about 0.0001 to 
10, preferably 0.002 to 5, more preferably to 0.001 to 2. Still more preferably, the cells are 
infected at a m.oi of about 0.01 . A comparison of viral yield at varying m.oi. levels is shown in 
FIGURE 26B. 

Infected cells may be harvested 30 to 60 hours post infection. Preferably, the cells are . 
harvested 34-48 hours post infection. Still more preferably, the cells are harvested 38 to 40 
hours post infection. See FIGURE 26C. 

Methods of purification of inactivated virus are known in the art and may include one or 
more of, for instance gradient centrifugation, ultracentrifugation, continuous-flow 
ultracentrifugation and chromatography, such as ion exchange chromatography, size exclusion 
chromatography, and liquid affinity chromatography. Additional method of purification include 
ultrafiltration and dialfiltration. See JP Gregersen "HersteUung von Virassimpfstoffen aus 
Zellkulturen" Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. Kayser and RH Mueller). 
Wissenschafthche Verlagsgesellschaft, Stuttgart, 2000. See also, O'Neil et aL, "Virus 
Harvesting and Affinity Based Liquid Chromatography. A Method for Virus Concentration and • 
Purification", Biotechnology (1993) 1 1:173-177; Prior et aL, "Process Development for 
Manufacture of Inactivated HIV-1", Pharmaceutical Technology (1995) 30-52; and Majhdi et al., 
"Isolation and Characterization of a Coronavirus from Elk Calves with diarrhea" Journal of 
Clinical Microbiology (1995) 35(1 1): 2937-2942. 

Other examples of purification methods suitable for use in the invention include 
polyethylene glycol or ammonium sulface precipitation {see Trepanier et aL, "Concentration of 
human respiratory syncytial virus using ammonium sulfate, polyethylene glycol or hollow fiber 
ultrafiltration" Journal of Virological Methods (1981) 3(4):201-211; Hagen et aL, "Optimization 
of Polyethylene glycol) Precipitation of Hepatitis Virus Used to prepare VAQTA, a Highly 
Purified Inactivated Vaccine" Biotechnology Progress (1996) 12:406412; and Carlsson et aL, 
"Purification of Infectious Pancreatic Necrosis Virus by Anion Exchange Chromatography 
Increases the Specific Infectivity" Journal of Virological Methods (1994) 47:27-36) as well as 
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ultrafiltration and mictofiltration (see Pay et oL. Developments in Biological Standardization 
(1985) 60:171-174; Tsunimi et al. y "Structure and filtration performances of improved 
cuprammonium regenerated cellulose hollow fibre (improved BMM hollow fibre) for virus 
removal" Polymer Journal (1990) 22(12): 1085-1 100; and Makino et aL, "Concentration of live 
5 retrovirus with a regenerated cellulose hollow fibre, BMM", Archives of Virology (1994) 139(1- 
2):87-96.). 

Preferably, the virus is purified using chromatography, such as ion exchange 
chromatography. Chromatic purification allows for the production of large volumes of virus 
containing suspension. The viral product of interest can interact with the chromatic medium by a 

10 simple adsorption/desorption mechanism, and large volumes of sample can be processed in a 
single load. Contaminants which do not have affinity for die adsorbent pass through the column. 
Hie virus material can then be eluted in concentrated form. 

Preferred anion exchange resins for use in the invention include DEAE, EMD TMAE. 
Preferred cation exchange resins may comprise a sulfonic acid-modified surface. In one 

15 embodiment, the virus is purified using ion exchange chromatography comprising a strong anion 
exchange resin (e.g. EMD TMAE) for the first step and EMD-SO3 (cation exchange resin) for 
the second step. A metal-binding affinity chromatography step can optionally be included for 
further purification. (See, e.g., WO 97/06243): 

A preferred resin for use in the invention is Fractogel™ EMD. This synthetic methacrylate 

20 based resin has long, linear polymer chains (so-called "tentacles") covalently attached This 
4t tentacle chemistry" allows for a large amount of sterically accessible ligands for the binding of 
biomolecules without any steric hindrance. This resin also has improved pressure stability. 

Column-based liquid affinity chromatography is another preferred purification method for ' 
use in the invention. One example of a resin for use in this purification method is Matrex™ 

15 Cellufine™ Sulfate (MCS). MCS consists of a rigid spherical (approx. 45-105 fim diameter) 
cellulose matrix of 3,000 Dalton exclusion limit (its pore structure excludes macromolecules), 
with a low concentration of sulfate ester functionality on the 6-position of cellulose. As the 
functional ligand (sulfate ester) is relatively highly dispersed, it presents insufficient cationic 
charge density to allow for most soluble proteins to adsorb onto the bead surface. Therefore the 

SO bulk of the protein found in typical virus pools (cell culture supernatants, e>g. pyrogens and most 
contaminating proteins, as well as nucleic acids and endotoxins) are washed from the column 
and a degree of purification of the bound virus is achieved 

The rigid, high-strength beads of MCS tend to resist compression. The pressure/flow 
characteristics the MCS resin permit high linear flow rates allowing high-speed processing, even 

►5 in large columns, making it an easily scalable unit operation. In addition a chromatographic 

purification step with MCS provides increased assurance of safety and product sterility, avoiding 
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excessive product handling and safety concerns. As endotoxins do not bind to it, the MCS 
purification step allows a rapid and contaminant free depyrogenation. Gentle binding and 
elution conditions provide high capacity and product yield Hie MCS resin therefore represents 
a simple, rapid, effective, and cost-saving means for concentration, purification and 
5 depyrogenation. In addition, MCS resins can be reused repeatedly. 

Hie inactivated virus may be further purified by gradient centrifugation, preferably density 
gradient centrifugation. For commercial scale operation a continuous flow sucrose gradient 
centrifugation would be the preferred option. This method is widely used to purify antiviral 
vaccines and is known to the expert in the field {See JP Giegersen "Herstellung von 

10 Virussimpfstoffen axis Zellkulturen" Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 
Kayser and RH Mueller) Wissenschaftliche Verlagsgesellschaft, Stuttgart, 2000.) 

The density gradient centrifugation step may be performed using laboratory or commercial 
scale gradient centrifugation equipment For example, a swinging bucket rotor, a fixed angle 
rotor, or a vertical tube rotor, particularly for laboratory scale production of the virus. 

15 Preferably, the gradient centrifugation step is performed using a swinging bucket rotor. This 
type of rotor has a sufficiently long pathlength to provide high quality separations, particularly 
with multicomponent samples. In addition, swinging bucket rotors have greatly reduced wall 
effects, and the contents do not reorient during acceleration and deceleration. Because of their 
longer pathlength, separations take longer compared to fixed angle or vertical tube rotors. The 

tO prepared sucrosfe solutions are controlled via refiactometer on their sucrose concentration. 
Sucrose gradients for density gradient centrifugation, such as in a swinging bucket 
centrifuge tubes may be formed prior to centrifugation by the use of a gradient former 
(continuous/linear). The volume of sample which can be applied to the gradient in a swinging 
bucket rotor tube is a function of the cross-sectional area of the gradient that is exposed to the 

5 sample. If the sample volume is too high, there is not sufficient radial distance in the centrifuge 
tube for effective separation of components in a multicomponent sample. 

An approximate sample volume for swinging bucket rotor SW 28 is 1-5 ml per tube (with a 
tube diameter of 2.54 cm). Hie sample is applied to the gradient by pipetting the volume on top 
of the gradient. The blunt end of the pipette is placed at 45-60° angle to the tube wall, 

0 approximately 2-3 mm above the gradient The sample is injected slowly and allowed to run 
down the wall of the tube onto the gradient After centrifugation gradient fractions are recovered 
by carefully inserting a gauge needle until the bottom of the tube and starting to collect fractions 
of 2 ml by pumping the liquid from the tube into falcon tubes. 

Sucrose density gradients suitable for use with this density gradient centrifugation 

5 purification step include 0 - 60%, 5 - 60%, 15 - 60%, 0 - 50%, 5 - 50%, 15 - 50%, 0 - 40%, 5 
- 40%, and 15 - 40%. Preferably, the sucrose density gradient is 15 - 40%, 5 - 40% or 0 - 40%. 
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Alternatively, a discontinuous sucrose density gradient may be used for purification. A 
discontinuous sucrose density scheme provides for discrete, overlaying layers of differing 
sucrose concentrations. In one example, a first layer of 50% sucrose is covered by a second 
layer of 40% sucrose; the second layer is covered by a third layer of 20% sucrose; the third layer 
5 is covered by a fourth layer of 10% sucrose; and the fourth layer is covered by the solution 
containing the virus to be purified 

In one-embodiment, inactivated virus is purified by a method comprising a first step of 
chromatography purification and a second step of gradient centrifugation. Preferably the first 
step comprises liquid affinity chromatography, such as MCS. Preferably, the second step 
10 comprises density gradient centrifugation using a swinging bucket rotor. 

Additional purification methods which may be used to purify inactivated SARS virus 
include the use of a nucleic acid degrading agent, preferably a nucleic acid degrading enzyme, 
such as a nuclease having DNase and RNase activity, or an endonuclease, such as from Serratia 
marcescens, commercially available as Benzonase™, membrane adsorbers with anionic 
15 functional groups (e.g. Sartobind™) or additional chromatographic steps with anionic functional 
groups (e.g. DEAE or TMAE). An ultrafiltration/dialfiltration and final sterile filtration step 
could also be added to the purification method 

Preferably, the purification includes treatment of the SARS viral isolate with one or more 
nucleic acid degrading enzymes. These enzymes may be used to reduce the level of host cell 
i0 nucleic acid in the viral purification process. Nucleic acid digesting enzymes for use in cell 
culture are known in the art and include, for example, Benzonase™. 

The treatment of the virus with the nucleic acid degrading enzyme and inactivating agent 
can be performed by a sequential treatment or in a combined or simultaneous manner. 
Preferably, the nucleic acid degrading agent is added to the virus preparation prior to the addition 
15 of the inactivating agent. 

The purified viral preparation of the invention is substantially free of contaminating 
proteins derived from the cells or cell culture and preferably comprises less than about 1000, 
500, 250, 150, 100, or 50 pg cellular nucleic acid / fig virus antigen, preferably less than about 
1000, 500, 250, 150, 100, or 50 pg cellular nucleic acid/ dose. Still more preferably, the purified 
10 viral preparation comprises less than about 20 pg, and even more preferably, less than about 10 
pg. Methods of measuring host cell nucleic acid levels in a viral sample are known in the art 
Standardized methods approved or recommended by regulatory authorities such as the WHO or 
the FDA are preferred 

The invention includes an inactivated vaccine composition comprising a prophylactically 
»5 effective amount of SARS viral antigen, preferably spike or an immunogenic fragment thereof. 
The SARS viral antigen is preferably present in a concentration amount of 0.1 to 50 fig 
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antigen/dose, moie preferably 03 to 30 fig antigen/dose. Still more preferably, the antigen is 
about 15 /ig/dose. 

In one embodiment, a lower concentration of SARS viral antigen is used in inactivated 

vaccine compositions of the invention. Such lower concentration vaccines may optionally 
5 comprise an adjuvant to boost the host immune response to the antigen. In such a 'low dose" 

vaccine, the SARS viral antigen is preferably present in a concentration of less than 15 fig 

antigen/dose, (i.e., less than 10, 7.5, 5 or 3 fig antigen/dose. 

The inactivated vaccine preparations of the invention may further comprise a stabilizer to 

preserve the integrity of the immunogenic proteins in the inactivated viral preparation. 
10 Stabilizers suitable for use in vaccines are known in the art and may include, for example, 

buffers, sugars, sugar alcohols, and amino acids. Stabilizing buffers are preferably adjusted to a 

physiological pH range and may include phosphate buffers, Tris buffers, TE (Tris/EDTA), TEN 

(Tris/NaCl/EDTA) andEarle's salt solution. Stabilizing sugars may include, for example, one or 

more of saccharose, glucose, fructose, dextranes, dextranesulphate, and trehalose. Stabilizing 
15 sugar alcohols may include, for example, Xylite/Xylitole, Mannite/Mannitol, Sorbite/Sorbitol, 

and Glycerol. Amino acids suitable for use in the invention include, for example, L^glutamine, 

arginine, cysteine, and lysine. Additional stabilizers which may be used in the invention include 

Tartaric acid, Pluronic F 68, and Tween 80. 

SARS viral isolates which may be used for the inactivated viral preparations of the 
10 invention may be obtained and identified by any of the mechanisms described supra. For 

example, a SARS isolate may be obtained from a clinical sample and plaque purified. Such 

methods of viral isolation are known in the art 

Further purification procedures can be applied to ensure the seed virus used for preparation 

of the vaccine does not contain, for example, unwanted adventitious agents. In one embodiment, 
5 viral RNA from the viral isolate can be isolated from the virus, purified (and, optionally, the 

sequence verified through PCR or other means) and then introduced into a suitable cell culture. 
As an example of this technique, a clinical viral sample is plaque purified and amplified on 

vero cells to generate a sufficient amount of the viral sample for analysis. Cellular remnants are 

then cleared from the supernatant by centrifugation. The virus can then be pelleted by 
0 ultracentrifugation and the pellet resuspended in PBS. After further centrifugation purification, 

the virus containing fraction is treated with a DNase (and optionally also an RNase). Viral RNA 

is then isolated from this fraction and transfected into a host cell. 

Examples 2 and 3 provide an illustration of purification of inactivated whole SARS virus 

using MCS chromatography resin purification followed by density gradient ultracentrifugation. 
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Routes and methods of immu nization of the vaccines of (he invention are discussed in 
more detail in a section below. Examples 4 and 5 provide illustrations of a mouse immunization 
scheme with the inactivated SARS virus of the invention. 

B. Attenuated SARS Vaccines 

5 The invention includes a composition comprising an attenuated SARS virus. This 
composition can be used as a prophylactic or therapeutic SARS virus vaccine. Methods of 
attenuating viruses are known in the art Such methods include serial passage of the SARS virus 
in cultured cells (e.g. 9 mammalian cell culture, preferably fetal rhesus kidney cells or VERO 
cells-see the discussion in Section A above regarding culture of SARS virus), until the SARS 

10 virus demonstrates attenuated function. The temperature at which the virus is grown can be any 
temperature at which with tissue culture passage attenuation occurs. Attenuated function of the 
SARS virus after one or more passages in cell culture can be measured by one skilled in the ait. 
As used herein, attenuation refers to the decreased virulence of the SARS virus in a human 
subject Evidence of attenuated function may be indicated by decreased levels of viral 

15 replication or by decreased virulence in an animal model. 

Other methods of producing an attenuated SARS virus include passage of the virus in cell 
culture at sub-optimal or "cold" temperatures and introduction of attenuating mutations into the 
SARS viral genome by random mutagenesis (e.g y chemical mutagenesis) or site specific directed 
mutagenesis. Preparation and generation of attenuated RS V vaccines (the methods of which will 

20 generally applicable to SARS virus) are disclosed in, for example, EP 0 640 128, US Patent No. 
6,284,254, US Patent No. 5,922,326, US Patent No. 5,882,651. 

The attenuated derivatives of SARS virus are produced in several ways, such as for 
example, by introduction of temperature sensitive-mutations either with or without chemical 
mutagenesis (e.g., 5-fluorouracil), by passage in culture at "cold" temperatures. Such cold 

15 adaptation includes passage at temperatures between about 20°C to about 32°C, and preferably 
between temperatures of about 22°C to about 30°C, and most preferably between temperatures 
of about 24°C and 28°C. The cold adaptation or attenuation may be performed by passage at 
increasingly reduced temperatures to introduce additional growth restriction mutations. Hie 
number of passages required to obtain safe, immunizing attenuated virus is dependent at least in 

10 part on the conditions employed. Periodic testing of the SARS virus culture for virulence and 
immunizing ability in animals mouse, primate) can readily determine the parameters for a 
particular combination of tissue culture and temperature. The attenuated vaccine will typically be 
formulated in a dose of from about 10 3 to 10 6 PFU or TCID50, or more for maximal efficacy. 
Attenuated virus vaccines for SARS-CoV also are produced by creating virus chimeras 

6 comprising sequences derived from at least two different coronaviruses, one of which is a SARS- 

CoV. For example, a virus chimera is produced that comprises nonstructural protein encoding 
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genes derived from a fiist arenavirus (e.g., murine, bovine, porcine, canine, feline, avian 
coronavirus) and one or more structural protein encoding genes (e.g., spike, E, M) from a SARS- 
CoV. Alternatively, the virus chimera may comprise sequences derived from a human 
coronavirus that is not a SARS-CoV (e.g., OC43, 229E) together with sequences from a SARS- 
5 CoV. Chimeric coronaviruses of the present invention are generated by a variety of methods, 
including for example allowing for natural RNA recombination in a eukaryotic (e.g. 9 
mammali an) cell that contains RNA from each of the parental coronaviruses (e.g., following 
infection) or by using standard molecular biology techniques known to those of skill in the art to 
engineer desired virus chimeras (or portions thereof) as cDNA clones, which may then be used to 
10 produce infectious virus (see for example, US 6593111 B2; Yount et aL, 2003, Proa Natl Acad. 
ScL USA 100(22): 12995-13000). An attenuated phenotype of the coronavirus chimeras 
described herein can be readily measured by one of skill in the art. 

Attenuated viruses can be also generated by deleting one or more open reading frames 
(ORFs) that are not essential for viral replication. Preferably, these deletions occur in the. 
.5 structural region of the genome, such as ORF 3a, 3b, 6, 7a, 7b, 8a, 8b, 9b. See e.g., Haijema BJ, - 
Volders H, Rottier PJ. J Virol. (2004) 78(8):3863-71; and de Haan, C. A., P. S. Masters, X. Shen, 
S. Weiss, and P. J. Rottier, 'The group-specific murine coronavirus genes ate not essential, but 
their deletion, by reverse genetics, is attenuating in the natural host" Virology (2002) 296:177- 
189. Deletion of such regions within a coronavirus such as SARS can be achieved, for example* 
:0 by reverse genetics or "targeted recombination" (See, e.g., Masters, P. S., "Reverse genetics of 
the largest UNA viruses > ', Adv. Virus Res. (1999)53:245-264. ' 

Methods of purification of attenuated virus are known in the art and may include one or 
more of, for instance gradient centrifugation and chromatography. See Gregersen "Herstellung 
von Virussimpfstoffen aus Zellkulturen" Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 
5 Kayser and RH Mueller) Wissenschaftliche Verlagsgesellschaft, Stuttgart, 2000. 
C. Svlii SARS Vaccines 

The invention includes a composition comprising a split SARS virus formulation and 
methods for the manufacture thereof. This composition can be used as a prophylactic or 
therapeutic SARS virus vaccine. 
0 Methods of splitting enveloped viruses are known in the art. Methods of splitting 

enveloped viruses are disclosed, for example, in WO 02/28422, incorporated herein by reference 
in its entirety, and specifically including the splitting agents and methods described therein. 
Methods of splitting influenza viruses are disclosed, for example, in WO 02/067983, WO 
02/074336, and WO 01/21151, each of which is incorporated herein by reference in its entirety. 
5 The splitting of the virus is carried out by disrupting or fragmenting whole virus, infectious 

(wild-type or attenuated) or non-infectious (for example inactivated), with a disrupting 
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concentration of a splitting agent Hie disraption results in a full or partial solubilisation of the 
virus proteins, altering the integrity of the virus. 

Preferably, the splitting agent is a non-ionic or an ionic surfactant Accordingly, the split 
SARS virus formulations of the invention may also comprise at least one non-ionic surfactant or 
detergent Examples of splitting agents useful in the invention include: bile acids and derivatives 
thereof, non-ionic surfactants, alkylglycosides or alkylthioglycosides and derivatives thereof, 
acyl sugars, sulphobetaines, betains, polyoxyethylenealkylethers, Njsf-dialkyl-Glucamides, 
Hecameg, alkylphenoxypolyethoxyethanols, quaternary ammonium compounds, sarcosyl, CTAB 
(cetyl trimethyl ammonium bromide) or Cetavlon. 

Preferably, the ionic surfactant is a cationic detergent Cationic detergents suitable for use 
in the invention include detergents comprising a compound of the following formula: 



Ri, R2 and R 3 are the same or different and each signifies alkyl or aryl, or 
Ri and R2, together with the nitrogen atom to which these are attached form a 5- or 6- 
membered heterocyclic ring, and 



Ri, R 2 and R 3 together with the nitrogen atom to which these are attached, signify a 5- or 6- 
membered heterocyclic ring, unsaturated at the nitrogen atom, 
R4 signfies alkyl or aryl, and 
X signifies an anion. 

Examples of such cationic detergents are cetyltrimethylammonium salts, such as 
ceytltrimethylammonium bromide (CTAB) and myristyltrimethylammonium salt 

Additional cationic detergents suitable for use in the invention include lipofectine, 
lipofectamine, and DOT-MA. 

Non-ionic surfactants suitable for use in the invention include one or more selected from 
the group consisting of the octyl- or nonylphenoxy polyoxyethanols (for example the 
commercially available Triton series), polyoxyethylene sorbitan esters (Tween series) and 
polyoxyethylene ethers or esters of the general formula : 



wherein n is 1-50, A is a bond or -C(OK R is C1.50 alkyl or phenyl Ci_ 50 alkyl; and 
combinations of two or more of these. 

The invention comprises a method of preparing a split SARS virus comprising contacting 
the SARS virus with a sufficient amount of splitting agent to disrupt the viral envelope. The loss 




wherein 



R3 signifies alkyl or aryl, or 



OCC^CHzOVA-R 
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of integrity after splitting renders the virus non-infectious. Once the disrupted viral envelope 
proteins are generally no longer associated with whole intact virions, other viral proteins are 
preferably fully or partially solubilized and are therefore not associated, or only in part 
associated, with whole intact virions after splitting. 
5 Hie method of preparing a split SARS virus may further comprise removal of the splitting 

agents and some or most of the viral lipid material. The process may also include a number of 
different filtration and/or other separation steps such as ultracentrifugation, ultrafiltration, zonal 
centrifugation and chromatographic steps in a variety of combinations. The process may also 
optionally include an inactivation step (as described above) which may be carried out before or 
10 after the splitting. Hie splitting process may be carried out as a batch, continuous, or semi- 
continuous process. 

Split SARS virus vaccines of the invention may include structual proteins, membrane 
fragments and membrane envelope proteins. Preferably, the split SARS virus preparations of the 
invention comprise at least half of the viral structural proteins. 
15 One example of a method of preparing a split SARS virus formulation includes the 

following steps: 

(i) propagation of the SARS virus in cell culture, such as MRC-5 cells (ATCC CCL-171), 
WI-38 cells (ATCC CCL-75), fetal rhesus kidney cells or vera cells (See the discussion in 
Section A, above, regarding culture of SARS virus); 
20 (ii) harvesting of SARS virus-containing material from the cell culture; 

(iii) clarification of the harvested material to remove non-SARS virus material; 

(iv) concentration of the harvested SARS virus; 

(v) separation of the whole SARS virus from non-virus material; 

(vi) splitting of the whole SARS virus using a suitable splitting agent in a density gradient 
25 centrifugation step; and 

(vii) filtration to remove undesired materials. 

The above steps are preferably performed sequentially. 

The clarification step is preferably performed by centrifugation at a moderate speed. 
Alternatively, a filtration step may be used for example with a 0.2/im membrane. 
50 The concentration step may preferably employ an adsorption method, for instance, using 

CaHPQ*. Alternatively, filtration may be used, for example ultrafiltration. 

A further separation step may also be used in the method of the invention. This further 
separation step is preferably a zonal centrifugation separation, and may optionally use a sucrose 
gradient The sucrose gradient may further comprise a preservative to prevent microbial growth. 
$5 The splitting step may also be performed in a sucrose gradient, wherein the sucrose 

gradient contains the splitting agent 

-117- 



WO 2004/092360 



PCT/DS2004/011710 



Hie method may further comprise a sterile filtration step, optionally at the end of the 
process. Preferably, there is an inactivation step prior to the final filtration step. 

Methods of preparing split SARS vims formulations may further include treatment of the 
viral formulation with a DNA digesting enzyme. These enzymes may be used to reduce the level 
5 of host cell DNA in the viral purification process. DNA digesting enzymes for use in cell culture 
are known in the art and include, for example, Benzonase®. 

Treatment of the SARS virus formulation with a DNA digesting enzyme may occur at any 
time in the purification and splitting process. Preferably, however, the SARS virus formulation 
is treated with a DNA digesting enzyme prior to use of a detergent Still more preferably, the 
0 SARS virus formulation is treated with a DNA digesting enzyme, such as Benzonas, prior to 
treatment with a cationic detergent, such as CTAB. 

Methods of purification of split virus are known in the art See JP Gregersen "Herstellung 
von Virussimpfstoffen aus Zellkulturen" Chapter 4.2 in Pharmazeutische Biotecnologie (eds. O. 
Kayser and RH Mueller) Wissenschaftliche Yerlagsgesellschaft, Stuttgart, 2000. 
.5 The invention includes a split vaccine composition comprising a prophylactically effective 

amount of SARS viral antigen, preferably spike or an immunogenic fragment thereof. The 
SARS viral antigen is preferably present in a concentration amount of 0.1 to 50 jig antigen/dose, 
more preferably 0.3 to 30 fig antigen/dose. Still more preferably, the antigen is about 15 
/ig/dose. 

0 In one embodiment, a lower concentration of SARS viral antigen is used in split vaccine 

compositions of the invention. Such lower concentration vaccines may optionally comprise an 
adjuvant to boost the host immune response to the antigen. In such a "low dose" vaccine; the 
SARS viral antigen is preferably present in a concentration of less than 15 fig antigen/dose, (ue., 
less than 10, 7.5, 5 or 3 fig antigen/dose. 

5 D. Subunit SARS Vaccines 

The invention includes a composition comprising an isolated or purified SARS viral 
antigen or a derivative thereof. The composition may further comprise one or more adjuvants. 

SARS viral antigens can be isolated or purified from a SARS virus grown in cell culture. 
Alternatively, SARS viral antigens can be recombinantly produced by methods known in the art. 

D The SARS viral antigens used in the invention can be produced in a variety of different 

expression systems which are known in the art; for example those used with mammalian cells, 
baculoviruses, bacteria, and yeast Such expression systems will typically use polynucleotides 
encoding the viral antigens of the invention. Such sequences can be obtained using standard 
techniques of molecular biology, including translating the amino acid sequences listed herein. 

5 Accordingly, the invention includes polynucleotides encoding for the viral antigens of the 
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invention. In addition, the viral antigens of the invention can be produced (at least in part, 
preferably in whole) via synthetic chemistry methods. 

Insect cell expression systems, such as baculovirus systems, are known to those of skill in 
the art and described in, e.g. , Summers and Smith, Texas Agricultural Experiment Station 
Bulletin No. 1555 (1987). Materials and methods for baculovirus/insert cell expression systems 
are commercially available in kit form from, inter alia, Invitrogen, San Diego CA. Similarly, 
bacterial and mammali an cell expression systems are also known in the art and described in, e.g. , 
Yeast Genetic Engineering (Barr et al, eds., 1989) Butterworths, London. 

A number of appropriate host cells for use with the above systems are also known. For 
example, mammalian cell lines are known in the art and include immortalized cell lines available 
from the American Type Culture Collection (ATCC), such as, but not limited to, Chinese 
hamster ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells 
(e.g., Hep G2), Madin-Darby bovine kidney ("MDBK") cells, as well as others. Mammalian 
sources of cells include, but are not limited to, human or non-human primate (e.g., MRC-5 
(ATCC CCL-171), WI-38 (ATCC CCL-75), fetal rhesus lung cells (ATCC OW60), human 
embryonic kidney cells (293 cells, typically transformed by sheared adenovirus type 5 DNA), 
VERO cells from monkey kidneys), horse, cow (e.g., MDBK cells), sheep, dog (e.g., MDCK 
cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, deposit number DSM 
ACC 2219 as described in WO 97/37001), cat, and rodent (e.g., hamster cells such as BHK21-F, 
HKCC cells, or Chinese hamster ovary cells (CHO cells)), and may be obtained from a wide 
variety of developmental stages, including for example, adult, neonatal, fetal, and embryo. 

Similarly, bacterial hosts such as E. coli, Bacillus subtilis, and Streptococcus spp., will find- 
use with the present expression constructs. Yeast hosts useful in the present invention include, 
inter alia, Saccharomyces cerevisiae, Candida albicans, Candida maltosa, Hansenual 
polymorpha, Kluyveromycesfragilis, EJuyveromyces lactis, Pichia guillerimondii, Pichia 
pastoris, Schizosaccharomyces pombe and Yarrowia lipolytica. Insect cells for use with 
baculovirus expression vectors include, inter alia, Aedes aegypti, Autographa californica, 
Bombyx mori, Drosophila melanogaster, Spodoptera frugiperda, and Trichoplusia ni. 

Nucleic acid molecules comprising nucleotide sequences of the viral antigens or antibodies 
of the invention can be stably integrated into a host cell genome or maintained on a stable 
episomal element in a suitable host cell using various gene delivery techniques well known in the 
art. See., e.g., US Patent No. 5,399346. 

Depending on the expression system and host selected, the molecules are produced by 
growing host cells transformed by an expression vector under conditions whereby the protein is 
expressed The expressed protein is then isolated from the host cells and purified. If the 
expression system secretes the protein into growth media, the product can be purified directly 
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from the media. If it is not secreted, it can be isolated from cell lysates. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art 

Hie invention includes a composition comprising an isolated or purified SARS viral 
antigpn or a derivative thereof. The invention also includes a composition comprising at least 

5 two isolated or purified SARS viral antigens or derivatives thereof, which have been co-purified 
or purified separately and then combined In one embodiment, the SARS viral antigen is a spike 
(S) protein. In yet another embodiment, the SARS viral antigen is a nucleocapsid (N) protein, a 
membrane (M) glycoprotein, or an envelope (E) protein. Preferably, the SARS viral antigen is 
present in the composition in a purity greater than 75% (e.g., 78%, 80%, 82%, 85%, 88%, 90%, 

10 92%, 95%, 98%). 

The invention includes a vaccine composition comprising a prophylactically effective 
amount of SARS viral antigen, preferably spike or an immunogenic fragment thereof. Hie 
SARS viral antigen is preferably present in a concentration amount of 0. 1 to 50 fig antigen/dose, 
more preferably 0.3 to 30 fig antigen/dose. Still more preferably, the antigen is about 15 

15 fig/dose. 

In one embodiment, a lower concentration of SARS viral antigen is used in vaccine 
compositions of the invention. Such lower concentration vaccines may optionally comprise an 
adjuvant to boost the host immune response to the antigen. In such a "low dose" vaccine, the 
SARS viral antigen is preferably present in a concentration of less than 15 fig antigen/dose, 
JO less than 10, 7.5, 5 or 3 fig antigen/dose. 

The following example illustrates a method of preparing a SARS virus spike (S) protein 
subunit vaccine. 

SARS virus S antigen may be isolated and purified from a variety of sources and using a 
variety of methods, including, but not limited to, S antigen expressed in cultured eukaryotic cells 

55 (e.g., mammalian cells, such as VERO, CHO) or bacteria (e.g., K coli). Expression of may be 
achieved by a variety of means, such as, for example, from SARS virus infected cell culture or 
cell culture supernatants, from cultured cells stably transformed with a DNA expression cassette 
encoding the SARS virus S protein (e.g., RNA polymerase II promoter operably linked to a 
SARS virus S gene), or from cultured cells infected with a replication-competent or replication- 

•0 incompetent virus-based expression vector (e.g., adenovirus vector, poxvirus vector, alphavirus 
vector, retrovirus vector) encoding the SARS vims S protein, as a means to eliminate the need to 
work with infectious SARS virus. 

1. Subunit SARS Vaccines Produced from SARS Virus Cultures 

The SARS virus may be grown in cultured mammalian celle, such as VERO cells, then 

5 separated from the cultured cells. A SARS viral antigen, such as the S protein, can then be 

solubilized and separated from the SARS virus, and further isolated and purified. 
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In one example, the SARS virus may be produced as described in the Inactivated SARS 
vaccine examples, then the desired SARS antigen, such as spike protein, may be further purified 
from the end product using techniques known in the art 

In another example, a SARS subunit vaccine may be produced as follows. SARS virus 
5 may be produced using a desired mammalian cell line on microcarrier beads in large, controlled 
fermentors. For example, vaccine quality African Green Monkey kidney cells (VERO cells) at a 
concentration of 10 s cells/mL are added to 60 to 75 L of CMRL 1969 media, pH 7.2, in a 150 L 
bioreactor containing 360 g of Cytodex-1 microcarrier beads and stirred for 2 hours. Additional 
CMRL 1969 is added to give a total volume of 150 L. Fetal bovine serum (FBS) is added to a 

10 final concentration of 3.5%. Glucose is added to a final concentration of 3.0 g/L and glutamine 
is added to a final concentration of 0.6 g/L. Dissolved oxygen, pH, agitation and temperature are 
controlled, and cell growth, glucose, lactate and glutamine levels are monitored. When cells are 
. in logarithmic phases usually on days 3 to 4 reached a density of about 1.0-2.5xl0 6 cells/mL, the 
culture medium is drained from the fermentor and 120 L of CMRL 1969, pH 7.2 (no FBS) is 

15 added and the culture stirred for 10 minutes. The draining and filling of the fermentor is usually 
repeated once but could be repeated up to three times. After washing the cells* the fermentor is . 
drained and 50 L of CMRL 1969 containing 0.1% (v/v) FBS is added. The SARS virus 
inoculum is added at a multiplicity of infection (m.o.i.) of 0.001 to 0.01. Trypsin may be added 
to promote efficient infection. Additional CMRL 1969 with 0.1% FBS is added to give a final 

20 volume of 150 L. Incubation is continued at 34 C. One viral harvest is obtained from a single 
fermentor lot, typically at 2-7 days post-infection. Multiple harvests from a single fermentation 
may also be obtained. 

The isolation and purification of S protein may be effected by a variety of means, as 
described below. For example, collecting S protein-containing flow-through from ion exchange 

25 chromatography of solubilized SARS virus envelope proteins; loading the flow through onto a 
hydroxyapatite matrix, and selectively eluting the S protein from the hydroxyapatite matrix. Hie 
selectively eluted S protein may be further concentrated by tangential flow ultrafiltration. 

Alternatively, the isolation and purification may be effected by collecting S protein- 
containing flow-through from ion exchange chromatography of the solubilized SARS virus 

JO envelope proteins; loading the flow through onto a hydroxyapatite matrix and collecting an S 
protein-containing flow through, selectively removing detergent used in the solubilization step 
from the hydroxyapatite matrix flow through to provide isolated and purified S protein. The 
isolated and purified S protein may be subsequently concentrated by tangential flow 
ultrafiltration 
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Nucleic acid contaminants may be removed from the isolated and purified S protein by 
treatment with a nucleic acid degrading agent as described above in the Inactivation section. 
Preferably, the nucleic acid degrading agent is a nuclease, such as for example, Benzonase. 
The isolated and purified S protein may be applied to a gel filtration medium and the S 
5 protein subsequently collected therefrom to separate the S protein from contaminants of other 
molecular weights. 

Alternatively, the isolation and purification may be effected by loading S protein on a first 
ion-exchange medium while permitting contaminants to pass through the medium, eluting the S 
protein from the first ion-exchange medium, to separate the S protein from contaminants of other 
10 molecular weights. The eluted S protein is applied to a second ion-exchange medium while 
allowing contaminants to pass through the second ion-exchange medium. The S protein is 
subsequently eluted therefrom, to provide the isolated and purified S protein. The eluted S 
protein may be. concentrated by tangential flow ultrafiltration. 

Alternatively, substantially pure SARS virus S protein suitable for use as an immunogen in 
5 a subunit vaccine formulation may be prepared from infected cell lysates, such as for example 
using a non-denaturing detergent buffer containing 1% Triton X-100 and deoxycholate to lyse 
infected cells. The cell lysates are clarified by centrifugation and S protein is purified from the 
cell lysates by immunoaffinity purification. A monoclonal antibody against the S protein is 
generated and coupled to beads and a column is constructed with those beads. SARS-infected 
0 cell lysates are applied to the column, and the column is washed with PBS containing 0. 1% 
Triton X-100. Protein bound to the column is eluted with 0.1M glycine, pH 2.5, 0.1% Triton X- 
100. Elution samples are buffered, such as for example, with Tris, and analyzed for the presence 
of protein. Fractions containing the protein are pooled and dialyzed against PBS 

As discussed above, the present invention includes isolated and purified S protein of SARS 
5 virus. Jh one example, the virus is grown on a vaccine quality cell line, such as VBRO cells, and 
the grown virus is harvested. The virus harvest is filtered and then concentrated typically using 
tangential flow ultrafiltration using a membrane of desired molecular weight cut-off and 
diafiltered. The virus harvest concentrate may be centrifuged and the supernatant discarded. The 
pellet from the centrifugation men is detergent extracted to solubilize the S protein, for example, 
) by resuspending the pellet to the original harvest concentrate volume in an extraction buffer 
containing a detergent such as a non-ionic detergent including TRITON X-100. 

Following centrifugation to remove non-soluble proteins, the S protein extract is purified 
by chromatographic procedures. The extract may first be applied to an ion exchange 
chromatography column such as a TMAE-fractogel or S-fractogel column equilibrated to permit 
the S protein to flow through while impurities are retained on the column. 
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Next, the flow through may be loaded onto a hydroxyapatite column, equilibrated to permit 
binding of the S protein to the matrix and to permit contaminants to pass from the column. The 
bound S protein is then eluted from the column by a suitable elutant The resulting purified 
solution of S protein may be further processed to increase its purity. The eluate first may be 
concentrated by tangential flow ultrafiltration using a membrane of desired molecular weight 
cut-off. The filtrate may be contacted with a polyethylene glycol of desired molecular weight, 
for example, about 6000 to 8000, to precipitate the protein. Following centritugation and discard 
of the supernatant, the pellet may be resuspended in PBS and dialyzed to remove the 
polyethylene glycol. Finally, the dialyzed solution of S protein may be sterile filtered. The 
sterile filtered solution may be adsorbed onto alum. The polyethylene glycol precipitation and 
resuspension purification step may be effected at an earlier stage of the purification operation, if 
desired: 

Alternatively, SARS virus is recovered following growth and harvesting of the virus, and a 
concentrate obtained such as, for example using PEG precipitation or tangential flow filtration. 
The virus is contacted with detergent to solubilize the S proteins. Following centrifugation, the 
supernatant is recovered to further purification of the S protein and the non-soluble protein's 
discarded. 

The supernatant is applied to an ion exchange chromatography column, such as a TMAE- 
fractogel or S-fractogel column, suitably equilibrated to permit retention of the S protein on the ' 
column. The S protein is eluted from the ion-exchange column under suitable conditions. The 
eluate then may be passed through a gel filtration column, such as a Sephacryl S-300 column, to 
separate the S protein from contaminants of other molecular weights. A hydroxyapatite column 
may be employed in place of the Sephacryl column. 

The S protein may be eluted from the column to provide a purified solution of S protein. 
The eluate may be concentrated by tangential flow ultrafiltration using a membrane of desired 
molecular weight cut-off. The concentrated S protein solution then may be sterile filtered. 

Alternatively, viral harvests may be concentrated by ultrafiltration and the concentrated 
viral harvests may be subjected to an initial purification step, for example, by gel filtration 
chromatography, polyethylene glycol precipitation or Cellufine sulfate chromatography. The 
purified virus may then be detergent extracted to solubilize the S protein. Following 
solubilization of the S protein, the supernatant may be loaded onto an ion-exchange column such 
as Cellufine sulfate chromatography column equilibrated to permit the protein to bind to the 
column while permitting contaminants to flow through. Similarly, a TMAE-fractogel or S- 
fractogel column may be used in place of the Cellufine sulfate column. The two columns also 
may be combined in sequential purification steps. The S protein is eluted from the columns to 
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provide a purified solution of the protein. This solution may be concentrated by tangential flow 
ultrafiltration using a membrane of desired molecular weight cut-off and diafiltered 

Specifically, in one method of S protein purification, the virus harvest concentrate is 
centrifuged at 28,000 x g for 30 minutes at 4 C. Hie supernatant is discarded and the pellet 

5 resuspended in extraction buffer consisting of 10 mM Tris-HCl, pH 7.0, 150 mM NaCl, 2% 
(w/v) Triton X-100 to the original harvest concentrate volume. Pef abloc is added to a final 
concentration of 5 mM The suspension is stirred at room temperature for 30 minutes. The 
supernatant, containing the soluble S protein, is clarified by centrifugation at 28,000 x g for 30 
minutes at 4 C. A TMAE-Fractogel column is equilibrated with 10 mM Tris-HCl, pH 7.0, 150 

10 mM Nad containing 0.02% Triton X-100. Hie Triton X-100 supernatant, containing the soluble 
S protein, is loaded directly onto the TRAR-Fractogel column. The total volume added plus 2 
bed volumes of 10 mM Tris-HCl, pH 7.0, 150 mM NaCl containing 0.02% Triton X-100 are 
collected The TMAE-Fractogel flow-through containing S protein is diluted 3-fold with 10 
mM Tris-HCl, pH 7.0, containing 0.02% Triton X-100. 

15 An hydroxyapatite column is equilibrated with 10 mM Tris-HCl, pH 7.0, 50 mM NaCl, 

0.02% Triton X-100. After loading the TMAE flow-through, the column is washed with 2 
column volumes of 10 mM Tris-HCl, pH 7.0, 50 mM NaCl , 0.02% Triton X-100 followed by 4 
column volumes of 5 mM sodium phosphate, pH 7.0, 1M NaCl, 0.02% Triton X-100. The 
proteins are eluted with 4 column volumes of 20 mM sodium phosphate, pH 7.0, 1M NaCl , 

20 0.02% Triton X-100. Fractions are collected based on A280 and the protein content and antigen 
concentrations are measured The purified S protein is ultrafiltered by tangential flow 
ultrafiltration using a 300 kDa NMWL membrane. 
2. Recombinant Production of Snhmrit SARS Vaccines 

As discussed above, SARS virus proteins may be produced by recombinant expression. 

25 Host cells suitable for recombinant expression include bacterial, mammalian, insect, yeast, etc. 
Recombinant expression may be used to produce a full length SARS protein, a fragment thereof, 
or a fusion therewith. 

Fusion peptides may be used to facilitate the expression and purification of the 
recombinant SARS protein. For example, recombinant production of the SARS polypeptides 

30 can be facilitated by the addition a tag protein to the SARS antigen to be expressed as a fusion 
protein comprising the tag protein and the SARS antigen. Such tag proteins can facilitate 
purification, detection and stability of the expressed protein. Tag proteins suitable for use in the 
invention include a polyargmine tag (Arg-tag), polyhistidine tag (EEs-tag), FLAG-tag, Strep-tag, 
c-myc-tag, S-tag, calmodulin-binding peptide, cellulose-binding domain, SBP-tag„ chitin- 

35 binding domain, glutathione S-transferase-tag (GST), maltose-binding protein, transcription 
termination anti-tenniniantion factor (NusA), E. coli thioredoxin (TrxA) and protein disulfide 
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isomerase I (DsbA). Preferred tag proteins include His-tag and GST. A full discussion on the 
use of tag proteins can be found at Terpe et aL y "Overview of tag protein fusions: from 
molecular and biochemical fundamentals to commercial systems**, Appl Microbiol Biotechnol 
(2003)60:523-533. 

5 After purification, the tag proteins may optionally be removed from the expressed fusion 

protein, Le. 9 by specifically tailored enzymatic treatments known in the art Commonly used 
proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor Xa- 

Accordingly, the invention further includes a SARS virus subunit vaccine comprising a 
fusion protein. Preferably, the fusion protein comprises a first amino acid sequence encoded by 
10 a SARS virus polynucleotide sequence. SARS virus polynucleotide sequences which may 
encode said first amino acid sequence include one or more of the SARS virus polynucleotide 
sequences identified in this application and fragments thereof. 

The fusion protein may comprise an amino acid sequence of a SARS virus protein or a 
fragment thereof- Said SARS virus protein may be selected from one or more of the group 
15 consisting of the following SARS virus proteins: P28, P65, Nspl, Nsp2 (3CL protease), Nsp3, 
Nsp3, Nsp4, Nsp 5, Nsp6, Nsp 7, Nsp 8> Nsp 9 (RNA polymerase), Nsp 10 (helicase), Nsp 11, 
Nsp 12, Nsp 13, Spike, Orf 3, Orf 4, Envelope, Matrix, Qrf 7, Qrf 8, Orf 9, Orf 10, Orf 1 1, 
Nucleocapsid and Orf 13. 

In one embodiment, the fusion protein comprises a first amino acid sequence comprising a 
tO SARS virus antigen or a fragment thereof. Said SARS virus amino acid sequence may comprise 
one or more of the T-epitope sequences identified above. 

Preferably, the fusion protein comprises an amino acid sequence of a SARS virus spike 
protein, or a fragment thereof. Specific fragments of the spike protein which may be used in the 
fusion protein include the SI domain and the S2 domain. Further fragments of the spike protein 
5 which may be used in the fusion protein include regions of each of the S 1 and S2 domains, 
including the receptor binding region of the SI domain, the oligomerization domain regions of 
the S2 domain, the leucine zipper regions of the S2 domain, the membrane anchor region of the 
S2 domain, the hydrophobic domain region of the S2 domain, the cystein-rich domain region of 
the S2 domain, and the cytoplasmic tail region of the S2 domain. (See FIGURE 19). Amino 
0 acid sequences of the Spike protein corresponding to these regions can be identified by those 
skilled in the art, including, for example, using the functional predictions set forth earlier in the 
application (predicted transmembrane helices, predicted N-terminus signaling regions, predicted 
coiled-coil regions, etc.) as well as by homology comparison to the sequences of other known 
Coronaviruses (See FIGURES 4F and 5). 
5 The fusion protein may further comprise a second amino acid sequence. Said second 

amino acid sequence may comprise a polypeptide sequence which facilitates protein expression 
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or purification, preferably one of the tag sequences discussed above. Alternatively, said second 
amino acid sequence may comprise a second amino acid sequence from a SARS vims. 
Alternatively, said second amino acid sequence may comprises an amino acid sequence from 
another vims or bacteria, including one or more of the viruses or bacteria identified in Section I, 
5 below. 

Said second amino acid sequence may comprise an amino acid sequence from another . 
respiratory virus. Said second amino acid sequence may comprise an amino acid sequence from 
a virus selected from the group consisting of coronavirus, influenza virus, rhinovixus, 
parainfluenza vims (PIV), respiratory syncytial virus (RSV), adenovirus, and metapneumovirus. 
10 In one embodiment, said second amino acid sequence may comprise an amino acid 

sequence from an adjuvant, including one or more of the adjuvants identified in section I, below. 

In one embodiment, the invention includes a fusion protein comprising an amino acid 
sequence of a SARS virus spike protein or a fragment thereof. The fusion protein may further 
comprise a second amino acid sequence comprising an amino acid sequence selected from the 
15 . group consisting of a second SARS vims protein, a non-SARS vims protein, a bacterial protein, 
and an adjuvant 

(a) Bacterial Expression ofSubunit SARS Vaccines 

In one embodiment, bacterial host cells are used for recombinant expression of SARS vims 
proteins. Bacterial host cells suitable for use in the invention include, for example, E. coli, 
10 Bacillus subtilis, and Streptococcus spp. 

The SARS viral protein may be modified to facilitate bacterial recombinant expression. In - 
particular, the SARS spike protein may be modified to facilitate transport of the spike protein to 
the surface of the bacterial host cell. 

Applicants have discovered that there is strong structural homology between the SARS 
15 vims spike protein and the NadA protein of Neisseria meningitidis. Both proteins have an N- 
terminal globular "head" domain (amino acids 24-87), an intermediate alpha-helix region with 
high propensity to form coiled-coil structures (amino acids 88-350), and a C-tenninal membrane 
anchor domain formed by four amphipatix transmembrane beta strands (amino acids 351-405 of 
NadA). In addition, a leucine zipper motive is present within the coiled-coil segment See, 
JO FIGURE 19 depicting the SARS spike protein structure Comanducci et at , "NadA, a Novel 
Vaccine Candidate of Neisseria meningitidis", J. Exp. Med 195 (11): 1445-1454 (2002). In 
addition, a leucine zipper motif of NadA is present within the coiled-coil segment. The NadA 
protein also forms high molecular weight surface-exposed oligomers (corresponding to three or 
four monomers) anchored to meningococcal outer membrane. 
*5 When the NadA protein is expressed in E. coli y the full-length protein is assembled in 

oligomers anchored to the outer membrane of E. coli 9 similar to the way the protein is presented 
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in meningococcus. The NadA protein devoid of the predicted membrane anchor domain is then 
secreted into the culture supernatant This secreted protein is soluble and still organized in 
trimers. 

The invention therefore includes a fusion protein comprising an amino acid sequence of a 

5 SARS virus spike protein or a fragment thereof and a second amino acid sequence of a bacterial 
adhesion protein or a fragment thereof. Preferably, said adhesion protein is selected from the 
group consisting of NadA, YadA (of enteropathogenic Yersinia), and UspA2 (of Moraxella 
catarrhalis). Additional NadA-like proteins include serum resistance protein DsrA of 
Haemophilus ducreyi, the immunoglobulin binding proteins EibA, C, D, and F of K coli, outer 

.0 membrane protein 100 of Actinobacillus actinomycetemcomitans, the saa gene carried on the 
large virulence plasmid present in shiga toxigenic strains of E. coli (STEC), and each of the 
bacterial adhesion proteins described in U.K. Patent Application No. 0315022.4, filed on June 
26, 2003, each of which are specifically incorporated herein by reference. 

Preferably, said adhesion protein comprises NadA or a fragment thereof. 

L5 Such fusion proteins may be used to facilitate recombinant expression of immunogenic 

portions of SARS surface antigens, such as spike. These fusion constructs may also allow the 
SARS SI and/or S2 domains to adapt to a native confirmation. These fusion proteins are also 
able to oligomerize and form dimers or trimers, allowing the SI and/or S2 domains to associate 
and adapt conformations as in the native SARS spike protein. Further, these expression 

10 constructs facilitate surface exposure of the SARS spike protein. 

Hie fusion proteins of the invention preferably comprise a leader peptide from a NadA like 
protein, preferably NadA, a polypeptide from the immunogenic "head" region of the spike 
protein, and a stalk region from either the NadA like protein or the Spike protein. During 
expression and processing of the fusion protein, one or more amino acids may be cleaved off or 

25 removed, such as, Le. , the leader peptide or a membrane anchor domain. 

The stalk regions facilitate oligomerization of the expression protein. Optionally, the 
fusion proteins of the invention further include an anchor region of a NadA like protein. This 
anchor region allows the expression fusion protein to anchor and assemble on the bacterial cell 
surface. 

30 The fusion proteins of the invention include the following constructs: 

(i) the NadA leader peptide (optionally also including the first 6 amino acids of the mature 
NadA protein to facilitate processing of the leader peptide and appropriate maturation of the 
protein) followed by the Spike SI domain. Preferably, this construct comprises amino acids 1- 
29 of NadA (corresponding to the NadA leader peptide and the first 6 amino acids of the mature 

35 NadA protein, as shown in FIGURE 22 and as set forth below) followed by amino acids 14-662 
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of a S ARS virus Spike protein (conesponding to the S 1 domain, see FIGURE 1 9 and SEQ ID 
NO: 6042 and as set forth below). Specifically, construct (i) comprises SEQ ID NO: 7302. 

(ii) the NadA leader peptide (optionally also including the first 6 amino acids of the mature 
NadA protein to facilitate processing of the leader peptide and appropriate maturation of the 

5 protein) followed by the Spike SI domain, followed by the stalk and anchor membrane domains 
of NadA. Preferably, this construct comprises amino acids 1-29 of NadA (corresponding to the 
NadA leader peptide and the first 6 amino acids of the mature NadA protein, as shown in 
FIGURE 22 and as set forth below) followed by amino acids 14-662 of a S ARS virus Spike 
protein (corresponding to the SI domain, see FIGURE 19 and SEQ ID NO: 6042 and as set forth 

10 below) followed by amino acids 88-405 of NadA (corresponding to the stalk and the anchor 
membrane domains). Specifically, construct (ii) comprises SEQ ID NO: 7303. 

(iii) the NadA leader peptide (optionally also including the first 6 amino acids of the 
mature NadA protein) followed by a S ARS virus Spike S 1 domain, followed by the NadA stalk 
domain. Preferably, this construct comprises amino acids 1-29 of NadA followed by amino 

15 acids 14-662 of a SARS virus Spike protein (corresponding to the SI domain), followed by 
amino acids 88-350 of NadA (corresponding to the stalk domain). Specifically, construct (iii) - 
comprises SEQ ID NO: 7304. 

(iv) the NadA leader peptide (optionally also including the first 6 amino acids of the 
mature NadA protein), followed by a SARS virus Spike SI and S2 domain (excluding the 

20 putative transmembrane region), followed by the anchor domain of NadA. Preferably, this 
construct comprises amino acids 1-29 of NadA, followed by amino acids 14-1195 of a SARS 
virus Spike protein (corresponding to SI and S2, excluding the putative transmembrane region), 
followed by amino acids 351-405 of NadA (corresponding to the NadA anchor domain). 
Specifically, construct (iv) comprises SEQ ID NO: 7305. Alternatively, the NadA anchor 

25 domain may comprise amino acids 332 - 405 of NadA. 

(v) the NadA leader peptide (optionally also including the first 6 amino acids of the mature 
NadA protein), followed by a SARS virus Spike SI and S2 domain (exclusing the putative 
transmembrane region). Preferably, this construct comprises amino acids 1-29 of NadA, 
followed by amino acids 14-1195 of a SARS virus Spike protein. Specifically, construct (v) 

30 comprises SEQ ID NO: 7306. 

In each of constructs (i) to (v), the first 23 amino acids are the NadA leader peptide, and 
the GS dipeptide at residues 679-680 arises from the insertion of a restriction enzyme site. 

In constructs (i), (ii) and (iii), the NadA "head" is replaced by the Spike SI domain, and 
the fusion proteins are anchored to the outer membrane of E.coli or secreted in the culture 
35 supernatant, respectively. In constructs (iv) and (v), the "head" and "stalk" domains of NadA are 
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replaced by SI and S2 Spike domains; also in this case, the two fusion proteins are anchored to 
the outer membrane of Kcoli or secreted in the culture supernatant, respectively. 

Accordingly, the invention further includes a fusion protein comprising an amino acid 
sequence of a SARS virus spike protein or a fragment thereof and a second amino acid sequence 

5 of a bacterial adhesion protein or a fragment thereof. Preferably, amino acids corresponding to 
the "head" of the adhesion protein are replaced by amino acids corresponding to a SARS virus 
Spike SI domain. Alternatively, the amino acids corresponding to the tc head" and "stalk" 
domains of the bacterial adhesion protein are replaced by amino acids corresponding to the 
SARS virus spike protein SI and S2 domains. 

10 As discussed above and shown in Figure 19, the S 1 domain of the Spike protein is 

identified as the globular receptor binding "head" region. The SI domain of the Spike protein 
preferably comprises about amino acids 14-662 of SEQ ID NO: 6042. The SI domain may 
comprise a shorter amino acid sequence, wherein amino acids are removed from either the N- 
terminal or C-terminal regions. Preferably, 3, 5, 7, 9, 13, 15, 20 or 25 amino acids are removed 

15 from either the N-terminal or C-terminal regions. The SI domain further includes amino acid 
sequences having sequence identity to the SI region of SEQ ID NO: 6042. An example of the 
SI domain is SEQ ID NO: 7307: 

As discussed above and shown in Figure 19, the S2 domain of the Spike protein is 
identified as the "stalk" region. The "stalk** region comprises oligomerization domain regions, a 

20 . leucine zipper domain regions, membrane anchor regions, hydrophobic domain regions, cystein- 
rich domain region and a cytoplasmic tail region. The S2 domain of the Spike protein preferably 
exclujdes the transmembrane region and comprises about amino acids 663-1 195 of SEQ ID NO: 
6042. The S2 domain may comprise a shorter amino acid sequence, wherein amino acids are 
removed from either the N-terminal or C-terminal regions. Preferably, 3, 5, 7, 9, 13, 15, 20 or 25 

25 amino acids are removed from either the N-terminal or C-terminal regions. The S2 domain 

further includes amino acid sequences having sequence identity to the S2 region of SEQ ID NO: 
6042. An example of the SI domain (with the transmembrane region excluded) is SEQ ID NO: 
7308. 

An example of the NadA protein described above is SEQ ID NO: 7309. As discussed 
30 above, the leader sequence of NadA used in the fusion protein preferably comprises about the 
first 29 amino acids of NadA (including a leader sequence with about 6 amino acids of the NadA 
head protein). Examples of such a leader sequences are set forth as SEQ ID NOS: 7310 and 
731 1 below. The fusion protein may use a leader sequence comprising a shorter amino acid 
sequence, wherein amino acids are removed from either the N-terminal or C-terminal regions. 
$5 Preferably, 1, 2, 3, 4, or 5 amino acids are removed from either the N-terminal or C-terminal end 
of the sequence. The leader sequence used in the fusion protein may also include an amino acid 
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sequences having sequence identity to SEQ ID NO: 7310 or SEQ ID NO: 7311. Preferably, the 
leader sequence comprises SEQ ID NO: 7311. 

Optionally, the fusion peptide comprises about die first 6 amino acids of the mature NadA 
protein to facilitate processing of the leader peptide and appropriate maturation of the protein. 

5 An examples of the first 6 amino acids of a mature NadA proteins is SEQ ID NO: 7312.. 

As discussed above, the stalk and anchor sequences of NadA used in the fusion protein 
preferably comprise about amino acids 88-405 of NadA. An example of an amino acid sequence 
comprising NadA stalk and anchor regions is set forth below as SEQ ID NO: 7313 below. An 
example of an amino acid sequence comprising a NadA stalk region (without the anchor region) 

10 is set forth as SEQ ID NO: 7314 below. An example of an amino acid sequence comprising a 
NadA anchor region is set forth as SEQ ID NO: 7315 below. The fusion protein may use a stalk 
(and/or anchor) sequence comprising a shorter amino acid sequence, wherein amino acids are 
removed from either the N-tenninal or C-terminal regions. Preferably, 1, 2, 3, 4, 5, 6, 7* 8 or 9 
amino acids are removed from either the N-terminal or C-terminal end of the sequence. The 

L5 leader sequence used in the fusion protein may also include an amino acid sequences having 
sequence identity to the SEQ ID NO: 7313. - 

The fusion proteins of the invention, including those described above, may be prepared, for 
example, as follows. Single fragments (such as the regions described above) may be amplified 
by PCR using the oligonucleotide primers set forth in the Table below. (S 1l refers to the Spike 

20 protein fused to the leader peptide of NadA; S2 refers to the stalk region of the Spike protein, 
with and without the stop codon). Hie oligonucleotides were designed on the basis of the DNA 
sequence of NadA from N. meningitidis B 2996 strain and of Spike from SARS virus isolate 
FRA1. Each oligonucleotide includes a restriction site as a tail in order to direct the cloning into 
the expression vector pET21b. 



SEQ ID NO: Restriction site 



S1l 


For 


7316 


Ndel 


S1l 


Rev 


7317 


BamHI 


S2 


For 


7318 


BamHI 


S2 


Rev 


7319 


HindDI 


S2-stop 


Rev 


7320 


Xhol 


NadAss 


For 


7321 


BamHI 


NadAsso 


Rev 


7322 


Xhol 


NadA3 32 


For 


7323 


HindlH 


NadA^ 


Rev 


7324 


Xhol 



25 

The single fragments are sequentially cloned into pET21b vector, in order to express the 
proteins under the control of inducible T7 promoter. The SI domain of the Spike protein fused 
to the leader peptide of NadA (S1l) was obtained by PCR using the primers SlL-For and S1l- 
Rev. The forward oligonucleotide primer contains the Ndel restriction sequence and the 
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sequence coding for the leader peptide of NadA plus the first 6 aminoacids of the mature protein. 
The PCR fragment was cloned as a Ndel/BamHI fragment in the pET21b vector opened with the 
same restriction enzymes. This clone (pET-SU) was then used to sequentially clone the other 
different domains, as BamEQ/XhoI, BamHI/Hindin or Hindm/Xhol fragments. BamHI and 

5 HindTTT restriction sites introduce the aminoacids GS and KL, respectively. 

Hie PCR amplification protocol was as follows: 200ng of genomic DNA from Neisseria 
meningitidis 2996 or 10 ng of plasmid DNA preparation (plasmid pCMVnew, containing the 
entire gene coding of the Spike protein), were used as template in the presence of 40 (iM of each 
oligonucletide primer, 400-800 \\M dNTPs solution, lx PCR buffer (including 1.5mM MgCl 2 ), 

10 2.5 units TaqI DNA polymerase (using Perkin-Elmer AmpliTaQ or Invitrogen Platinum Pfx 
DNA polymerase). 

After a preliminary 3 minute incubation of the whole mix at 95°C, each sample underwent 
a two-step amplification: the first 5 cycles were performed using the hybridisation temperature - 
that excluded the restriction enzyme tail of the primer (Tml). This was followed by 30 cycles 
15 according to the hybridisation temperature calculated for the whole length oligos (Tm2); 

Elongation times, performed at 68°C or 72°C, varied according to the length of the fragment to 
be amplified/The cycles were completed with a 10 minute extension step at 68°C or 72°C. 

The amplified DNA was either loaded directly on agarose gel and the DNA fragment 
corresponding to the band of correct size was purified from the gel using the Qiagen™ Gel 
50 Extraction Kit, following the manufacturer's protocol. 

The purified DNA corresponding to the amplified fragment and the plasmid vectors' were 
digested with the appropriate restriction enzymes, purified using the QIAquick™ PCR 
purification kit (following the manufacturer's instructions) and ligation reactions were . 
performed. 

5 The ligation products were transformed into competent E. coli DH5a and screening for 

recombinant clones was performed by growing randomly-selected colonies and extracting the 
plasmid DNA using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions. 

Recombinant plasmids were introduced into E. coli BL21(DE3) used as expression host 
0 Single recombinant colonies were inoculated into LB + ampicillin and incubated at 37°C for 14- 
16 h. Bacteria were directly recovered by centrifugation (uninduced conditions) or diluted in 
fresh medium and grown at 37°C until ODeoo between 0.4-0.8. Protein expression was induced 
by addition of 1 mM Isopropyl-l-thio-P-D-galactopyranoside (IPTG) for three hours (induced 
conditions). 

5 Whole cell lysates were obtained resuspending bacteria in SDS-sample buffer IX and 

boiling for 5-10 min. Equal amounts of proteins were separated using NuPAGE (Invitrogen) or 
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BIORAD Gel System, according to the manufacturer's instructions. Proteins were revealed by 
Coomassie-blue staining or transferred onto nitrocellulose membranes for western blot analysis. 
Western blot was performed using a rabbit polyclonal anti-serum against purified NadA^i^cs 
(diluted 1:3000) and a secondary peroxidase-conjugate antibody (DAKO). 
5 Results of the expression in Kcoli of Sl u Sl L -NadA and SlL-NadAAmchor are shown in 

FIGURES 38 and 39. Schematics of the fusion constructs are shown in FIGURE 37. 

Bacterial expression of the SARS viral antigens may also be used to prepare compositions 
comprising outer membrane vesicles wherein said outer membrane vesicles comprise one or 
more SARS viral antigens. 

10 Outer Membrane Vesicles ("OMV"), also referred to as blebs, refer to vesicles formed or 

derived from fragments of the outer membrane of a Gram negative bacterium. OMVs typically 
comprise outer membrane proteins (OMPs), lipids, phospholipids, periplasmic material and 
lipopolysaccharide (LPS). Gram negative bacteria often shed OMVs during virulent infections 
in a process known as Webbing. OMVs can also be obtained from Gram negative bacteria via a 

15 number of chemical denaturation processes, such as detergent extraction. Synthetic OMVs or 
liposomes, comprising a lipid bilayer and typically enclosing an aqueous core, can also be 
prepared with the SARS viral antigens of the invention. 

The OMVs of the invention are preferably lipid vesicles comprising a lipid bilayer 
surrounding an aquous core. Typically the lipid vesicles are of unilamellar structure (Le., a 

£0 single lipid bilayer surrounds the aquous core), although multilammelar lipid vesicles may also 
be used in the compositions of the invention. OMVs typically have sizes in the nanomolar to 
micromolar range, e.g., from 1 nM to 100 jiM, more typically from lOnM to 10 /iM and 
preferably from 30 nM to 1 /iM. 

The OMVs of the invention are preferably prepared from gram negative bacteria. Gram 

55 negative bacteria are those bacteria that fail to resist decolonization in the commonly known 
Gram staining method. Gram negative bacteria are characterized by a complex multilater cell 
wall and often possess an outer layer polysaccharide capsule. Gram negative bacteria suitable 
for producing OMVs include, for example, species from Neisseria, Moraxella, Kingella, 
Acinetobacter, Brucella, Bordetella, Chlamydia, Porphyromonas, Actinobacillus, Borelia, 

i0 Serratia, Campylobacter, Helicobacter, Haemophilus, Escherichia, Legionella, Salmonella, 
Pseudomonas and Yersinia. 

The OMVs of the invention preferably comprise one or more SARS viral antigens or a 
fragment thereof. The SARS viral antigens may be recombinantly expressed in a Gram negative 
bacterial host cell and then harvested with the OMV. 

5 Antigenic components, such as recombinantly expressed SARS viral antigens, may be 

located in any or all of the three main compartments of the lipid vesicles, including attached to 
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either the interior or exterior surface of the lipid vesicle, for example via a membrane anchor 
domain, or attachment to a lipid moiety; inserted into the lipid bilayer, for example where the 
antigenic component is itself a hydrophobic or lipid based entity, or located within the aqueous 
center or core of the lipid vesicle. 

Synthetically prepared OMVs, or liposomes, may be used in the invention. Such 
liposomes may comprise a number of different lipids and fatty acids. Suitable lipids for 
inclusion in liposomes of the invention include but are not limited to phophatidylinositol-(4,5)- 
diphosphate, phosphatidylserine, phosphatidylcholine, phosphatidylethanolamine, 
phosphatidyglycerol, cholesterol, beta-oleolyl-gamma-palmitoyl, lipopolysaccharides and 
galactocerbrosides. 

Suitable means for extraction of OMVs from bacterial sources include deoxycholate 
extraction, Tris/HCl/EDTA extraction, and lithium acetate extraction. Preferably, the extraction 
process comprises a physical and/or chemical means to disrupt the bacterial cell outer membrane 
in order to release sufficient OMVs for purification and isolation. See, e.g., WO 03/051379. 

The OMVs of the invention may be enriched and/or supplemented with antigenic 
components, such as SARS viral antigens, by methods known in the art, including, for example, 
direct combination in vitro where an energetic combination step can optionally be applied to 
facilitate integration of the antigenic component into a compartment of the liposome. Methods 
of energetic combination suitable for use in the invention include homogenization, 
ultrasonication, extrusion, and combinations thereof. 

Preferably, the antigenic component, such as the SARS viral antigen, is recombinantly 
produced by the host cell from which the OMV is derived In one embodiment, such OMVs are 
prepared by introducing nucleic acid sequence encoding for the SARS viral antigen into the 
recombinant host cell. Preferably the nucleic acid sequence encoding for the SARS viral antigen 
is controlled by a strong promoter sequence. Preferably, the nucleic acid sequence encoding the 
SARS viral antigen further comprises an outer-membrane targeting signal. For example, the 
nucleic acid sequence encoding the SARS viral antigen may be fused to a sequence encoding for 
a naturally occurring outer membrane protein of the bacterial host Preferably, the nucleic acid 
sequence encoding the SARS viral antigen is fused to the signal peptide sequence of the 
naturally occurring outer membrane protein of the bacterial host 

Methods of preparing an optimizing OMVs for use in vaccines are disclosed in, for 
example Klip et ai, J. Bad. (1973) 115: 717-722; Davies etal. 9 J. Immunol Method (1990) 
143:215-225; and WO 01709350. 

In one embodiment, a bacterial host cell, such as E. coli, are transformed to express the 
SARS spike protein. As discussed above, the spike protein may be modified to facilitate 
bacterial expression and transport of the spike protein to the surface of the host cell. Each of the 
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Spike/NadA fusion constructs discussed above may be used in the OMV preparations of the 
invention. Preferably, constructs comprising the spike SI globular head domain fused to the 
stalk region of NadA are used to generate OMVs. The construct may optionally include the 
NadA leader peptide as well as the NadA anchor peptide. Schematic diagrams of these preferred 
OMV constructs are depicted in FIGURE 49. 

Example 6 describes one method of preparing the OMVs of the invention. 
(b) Mammalian Expression ofSubunit SARS Vaccine 

As discussed above, mammalian host cells may be used for recombinant expression of 
S ARS virus proteins. Mammalian host cells suitable for use in the invention include, for 
example, Chinese hamster ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, 
monkey kidney cells (e.g., Hep G2), Madin-Darby bovine kidney ("MDBK") cells, as well as 
others. Mammalian sources of cells include, but are not limited to, human or non-human primate 
(e.g,, MRC-5 (ATCC CCL-171), Wl-38 (ATCC CCL-75), human embryonic kidney cells (293 
cells, typically transformed by sheared adenovirus type 5 DNA), VERO cells from monkey 
kidneys (including, for example COS7 cells), horse, cow (e.g., MDBK cells), sheep, dog (e.g., 
MDCK cells from dog kidneys, ATCC CCL34 MDCK (NBL2) or MDCK 33016, deposit 
number DSMACC 2219 as described in WO 97/37001), cat, and rodent {e.g., hamster cells such 
as BHK21-F, HKCC cells, or Chinese hamster ovary cells (CHO cells)), and may be obtained 
from a wide variety of developmental stages, including for example, adult, neonatal, fetal, and 
embryo. 

The polynucleotides encoding the SARS viral proteins may be modified to facilitate or 
enhance expression. For example, commercial leader sequences known in the art, such as tPA or 
IgK or interleukin-2, may be used in the recombinant constructs. Preferably, however, the 
natural SARS leader sequence is used. Use of the natural leader sequence can be used to ensure 
that the protein will be trafficked in human cells in the same way as during a normal viral 
infection, which may be advantageous e.g. for DNA vaccines, where antigen is expressed in situ. 

As discussed above, tag sequences can be used in the expression constructs to facilitate 
purification, detection and stability of the expressed protein. Tag proteins suitable for use in the 
invention include a polyarginine tag (Arg-tag), polyhistidine tag (His-tag), FLAG-tag, Strep-tag, 
c-myc-tag, S-tag, calmodimn-binding peptide, ceUulose-binding domain, SBP-tag„ chitin- 
binding domain, glutathione S-transferase-tag (GST), maltose-binding protein, transcription 
termination anti-terminiantion factor (NusA), K coli thioredoxin (TrxA) and protein disulfide 
isomeraseI(DsbA). Preferred tag proteins include His-tag and GST. A full discussion on the 
use of tag proteins can be found at Terpe et aL, "Overview of tag protein fusions: from 
molecular and biochemical fundamentals to commercial systems", Appl Microbiol Biotechnol 
(2003) 60:523-533. 
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After purification, the tag proteins may optionally be removed from the expressed fusion 
protein, ie., by specifically tailored enzymatic treatments known in the art Commonly used 
proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor X*. 

One or more amino acid sequences or amino acid domains of the spike protein may be 

removed to facilitate mammalian recombinant expression. For instance, the entire S2 domain or 

the spike transmembrane region may be removed Representative examples of some expression 

constructs of both full length and truncated spike glycoprotein suitable for mammalian 

expression are shown in FIGURE 40. Polynucleotide sequences representing each construct are 

shown in SEQ ID NOS 6578-6583. A description of each annotation is shown below: 

Clone Name Description Expression Construct 

nSh natural leader sequence SEQ ID NO: 6578 

full length Spike 

histidine tag 

nS natural leader sequence SEQ ID NO: 6579 

full length Spike 

nShATC natural leader sequence SEQ ID NO: 6580 

Spike without transmembrane sequence 
histidine tag 

nSATC natural leader sequence SEQ ID NO: 6581 

Spike without transmembrane sequence 
nSlh natural leader sequence SEQ ID NO: 6582 

SI domain 

histidine tag 

nSl natural leader sequence SEQ ID NO: 6583 

SI domain 



Cloned cDNA fragments that encompass full-length Spike coding sequences, as well as a 
Spike construct deleted of the transmembrane and cytoplasmic domains (TM-Cy-deleted Spike) 
for secretion were inserted into an expression vector pCMVm to create nSh and nShATC, 
respectively. Both spike proteins were tagged with six histidine residues at the end of C- 

.5 terminus to aid initial characterization of the expressed spike proteins. Similar sequences 

encoding full-length Spike or transmembrane and cytoplasmic domain deleted Spike, but without 
the histidine "tag" are readily substituted by one of skill in the art 

The likely locations of the expressed spike constructs was assessed by separating expressed 
proteins into an aqueous fraction (AF) and a detergent fraction (DF) using the procedure shown 

,0 in figure 48, with results of western blot analysis shown in figure 43. The above described 

vector constructs were evaluated for expression after transfection into COS7 cells. The construct 
expressing the full length spike protein remained in the cell membrane while the construct 
expressing the truncated spike protein was located either in the cytosol (figure 43) or secreted 
into the cell medium (Figure 44). As shown in Figure 43, full-length spike protein is found in DF 

5 (membrane) in an aggregated form, while the truncated protein is found in AF (cytosol) as a 
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monomer. As shown in Figure 44, deleted proteins (ShATQ are secreted, and a small fraction of 
full-length spike protein is detected in the medium by rabbit serum. 

Recombinant^ expressed spike proteins may be oligomerized. When the spike proteins 
are to be used in a vaccine or to generate antibodies specific to the spike protein, they are 
5 preferably oligomerized. In order to obtain oligomerized spike protein, it is preferred to 
maintain the transmembrane domain in the recombinant expression construct Fbr example, 
FIGURE 41 illustrates a western blot of COS7 cell lysates comparing expressed nSh and 
nShATC using both anti-his tag and rabbit anti-S ARS antibodies. As shown full-length (nSh) 
aggregates, but the truncated (nShATC) spike protein does not Antibody raised against the His- 
10 tagged protein recognizes full-length and truncated spike proteins in native and reduced forms. 
Rabbit antiserum recognizes spike protein only in non-reducing conditions. Spike aggregates or 
oligomers were present in larger amounts in the cell lysates from the expressed nSh constructs. 
Preferably, the oligomerized spike proteins form a homotrimer, as indicated in FIGURE 47 

A further experiment, illustrated in FIGURE 42, demonstrates that the oligomerization of 
15 the expressed nSh constructs is likely due to a non-covalent linkage (and is likely not due to, for 
example, a disulfide bond). The oligomer dissociates into monomers at elevated temperature 
(80-100°C), but is stable in reducing conditions if not heated. 

It is further preferred that recombinantly expressed spike proteins are glycoslyated. 
Tunicamycin and glycosidases were used to assess glycosylation. FIGURE 45 illustrates that 
50 glycoslation of expressed spike proteins is not affected by removal of the transmembrane domain 
region. Both full-length (Sh) and truncated (ShATC) S ARS spike proteins are glycosylated. 

Preferably, expression of the constructs of the invention is not toxic to the mammalian host 
cell. FIGURE 46 demonstrates that expression of the illustrated spike constructs is not toxic to 
the COS7 host cell. 

:5 Methods for transfecting, expressing, culturing, isolating and purifying recombinant 

proteins from mammalian cell cultures are known in the art For example, the SARS spike 
constructs of the invention may be expressed in 293 cells. These cells may be cultured and 
transfected in static or monolayer cultures. For rapid large-scale production of SARS protein 
antigens in sufficient quantities for in vitro and in vivo evaluation, including immunogenicity 

0 studies, large-scale transient transfection of 293 (human embryonic kidney) cells may be used to 
obtain milligram quantities of the recombinant antigen(s). Alternatively, larger scale transfection 
of these cells may be performed with 293 cells in suspension culture. Preferably, the expressed 
SARS proteins are harvested from the transfected cells between 48 and 72 hours after 
transfection or even from 72 to 96 or more hours after transfection. 

5 Where the host cells are transfected with truncated spike expression constructs, the 

expressed spike protein is secreted from the host cells and collected from the cell media. After 
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concentration, the spike protein may be purified from the media using, for example, GNA lectin 
followed by DEAE and ceramic hydroxyaparite column chromatography. . 

Where the host cells are transfected with full length spike expression constructs, but rather 
is retained within the cells, and may be purified from triton X-100 detergent extracted cells. The 
5 full-length Spike protein can then be captured on GNA lectin, followed by hydroxyaparite and 
SP chromatography. 

Chinese Hamster Ovary (CHO) or other eukaryotic (e.g., mammalian) cells that stably 

express the SARS viral antigens of the invention may also be derived (e.g. Figure 73). 

Preferably, the cells are CHO cells, and these constructs will comprise one or more marker or 
.0 selection genes in order to select for the desired CHO cells. In one embodiment, the constructs 

comprise a CMV enhancer/promoter, ampicillin resistance gene, and a fused DHFR and 

attenuated neomycin gene for selection purposes. Stable cell lines can then be produced using 

the neomycin selection system in CHOK-1 cells. Selected clones can then be sequenced to 

verify the integrity of the insert, and transient transections can men be performed using Trans- 
5 LT1 polyamine transfection reagent (PanVera Corp., Madison, WI) to assessthe expression level 

and also the integrity of the expressed protein by ELESA and western blot analysis. 

Methods for derivation of CHO cells stably expressing the SARS viral antigens of the 

invention comprise the steps of transfection and primary screening with selective medium. 

Optionally, these steps are followed by subcloning to assure purity of cell lines. Cell culture 
0 supernatants can be assayed using an antigen capture ELISA to quantify expression levels at all 

stages of selection and amplification. 

• For full-length Spike expression constructs, methanol fixed cells can be screened for 

internal expression by immunofluorescent staining using a rabbit anti-SARS antibody. 

Successive measurements at the T75-flask stage of expansion can be employed to assure stability 
5 of expression levels. The molecular mass and integrity of the expressed proteins can be checked 

by PAGE both under native and reducing and denaturing conditions, followed by 

immune-probing. 

In one embodiment, the pCMV3 vectors expressing SARS-CoV Spike proteins in either 
full-length or truncated forms is introduced into CHOK-1 cells using the Trans-LT-1 reagent 
) On day one, lxlO 6 cells are plated on 100 mm dishes in non-selective F12 media + 10% Fetal 
Bovine Serum + 4 mM Glutamine. On day two, the cells are transfected with a DNA:LT-1 
mixture and the media then replaced with complete F12 media. Twenty-four to forty-eight hours 
later depending on the cell density, each 100 mm dish is split to 4-6 100 mm dishes. The 
medium is changed to complete selective media containing Geneticin (neomycin) at 500 jig/ml. 
All bovine serum used in these procedures is from TSE-ftee sources that meet current FDA 
standards. Twenty-four hours later the medium is changed to complete selective medium plus 

-137- 



WO 2004/092360 



PCT/US2004/011710 



500 ug/ml neomycin. Ten to fourteen days later, individual colonies are picked and transferred 
to 96 well plates and cultured in complete selective medium but without G418. When 
approximately 80% of the wells are confluent, twenty-four hour supernatants are screened by 
spike capture EJJSA positive clones are transferred to twenty-four well plates. For the initial 
5 expression of full length Spike protein, methanol fixed cells will be screened by 

immunoflourescent staining using a rabbit anti-S ARS antibody. After the low expressing cell 
lines have been eliminated and there are less than 20-30 cell lines, capture ELBA and westerns 
will be used to determine the expression level after cell lysis. A portion of each cell line will be 
pelleted, weighed and lysed in 1% triton lysis buffer containing MOPS, NaCl and MgCk at the 

10 same ratio of cell weight to lysis buffer. After lysis the supernatant is collected and expression 
level is determined Three to four clones producing the highest levels of spike protein in correct 
structure and conformation will be grown in three-liter bioreactors for expansion and adaptation 
to low serum suspension culture conditions for scale-up. 

The antigen capture ELIS A assay for the SARS spike protein can be performed as 

15 described in the art A brief description of this assay follows. 96 well flat-bottom plates ; 
(Corning, Corning, NY) are coated with 250ng per well of purified immunoglobulin obtained 
from rabbit sera that were immunized with inactivated SARS virus. Between steps, the plates 
are washed in a buffer containing 16%NaCl and 1% Triton X100. IOOjiL of supernatant or 
lysate samples (diluted in a buffer containing lOOmM NaP0 4 , 0.1% Casein, ImM EDTA, 1% 

10 Triton X100, 0.5M NaCl and 0.01% Thiomersal, pH 7.5) are added and incubated for 2 hours at 
37°C. Bound antigen is reacted against pooled SARS+ve serum or high affinity monoclonal 
antibody either human or mouse against SARS spike protein (1 hour incubation, 37°C) and 
detected using appropriate species-specific peroxidase conjugated second antibody (30 minute 
incubation at 37°C; TAGO, Burlingame, CA). The plates are developed for 15 minutes at room 

55 temperature using 1MB substrate (Pierce, Rockford, IL) and the reaction stopped using 4N 
phosphoric acid. The plates are read at a wavelength of 450nm and the concentration of protein 
per ml sample is derived from a standard curve (OD vs. protein concentration) based on serial 
dilutions of a known concentration of recombinant spike protein. 

The immunoprobing analysis can also be performed following the standard methods 

0 described elsewhere in the art. A brief description follows. 10-20 \A of the sample is analyzed 
on 4-20% SDS PAGE under non-reducing/ denaturing conditions with mild heating. The gels 
are run for 1.5-2.0 hours at 100 V constant voltage. The proteins are then transferred onto 
nitrocellulose membranes (Millipore, Bedford, MA) for 45 min using the semidry western 
transfer system (BioRad, Hercules, CA) following the manufacturer's instructions. The 

5 membrane is then reacted against polyclonal anti-spike rabbit serum, followed by anti-rabbit Ig 
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conjugated to Alexa 688 (Molecular Probes, Oregon). The blots are scanned using an infrared 
imaging system (Ll-Cor, Inc., Lincoln, Nebraska). 

The highest expressing candidate cell lines can be screened for spike protein expression 
and stability in small-scale (3 liter) suspension cultures. The candidate clone can be further 
evaluated for level of expression as well as integrity of expressed protein after amplification, and 
subsequendy tested for expression stability in the absence of selection. The selected clones can 
also be tested for maintenance of the DNA sequence integrity of the integrated SARS spike 
protein gene. To quickly monitor the expression levels in small flask (T25 or 175) and in the 
three liter evaluation cultures, a lectin-based process (Gluvanthus Nivalis lectin) may be used to 
isolate SARS spike protein to a degree of purity that allows semi-quantitation and 
characterization of the protein in CHO supernatant For full-length spike protein, it will be 
obtained from triton X-100 detergent extracted cells. Full-length Spike protein will be then 
captured on GNA lectin, followed by hydroxyapatite and SP chromatograph. FJuted protein is 
then characterized by: 1) polyacrylamide gel electrophoresis (PAGE) and Coomassie staining, 2) 
Immunoprobing with anti-SARS rabbit sera, 3) structural characterization using size exclusion 
chromatography (SEC), as well as mass spec analysis using MALDI-TOF. 

Routes and methods of immunization of the vaccines of the invention are discussed in 
more detail in a section below. Examples 7 to 9 illustrate sample immunization protocols for the 
recombinant spike proteins. 

Vaccine testing ' 

Prior to human administration, it is normal to test vaccines in animal models. A mouse 
model of SARS coronavirus infection is known (Subbarao et al. (2004) J Virol 78:3572-77), and 
other animals that may be used as models of infection and/or disease include ferrets and 
monkeys. Thus the invention provides a non-human animal that is infected by the SARS 
coronavirus, wherein the animal is preferably a ferret or a primate (e.g. a monkey or a macaque). 
The animal may be gnotobiotic. The animal is preferably not a cat (Felis domesticus). The 
animal may or may not display SARS disease symptoms e.g. ferrets (Mustela furd) show 
prominent pulmonary pathology after infection. See: Martina et al. (2003) Nature 425:915. 
E. Polynucleotides tmmdi ne the SARS Antigens nftiw Immtinr, 

The invention includes polynucleotides encoding for the SARS antigens of the invention. 
In addition, the invention includes polynucleotides which have been optimized for recombinant 
production (e.g. codon optimization) of the SARS antigens of the invention, including 
polynucleotides encoding for each of the SARS fusion constructs discussed above. 



-139- 



WO 2004/092360 



PCTAJS2004/011710 



F. Viral vector or Viral Particle delivery of the SARS Antigens of the Invention 

The antigens of the invention may be expressed in vivo or in vitro by polynucleotides 
encoding the antigens. Expression and delivery of the polynucleotides of the invention may be 
facilitated via viral vectors and/or viral particles. 
5 Gene-based delivery systems derived from viruses, such as alphaviruses, are useful for the 

ex vivo and in vivo administration of heterologous genes, including one or more SARS genes, 
having therapeutic or prophylactic applications. These systems can also be used for the 
production of recombinant proteins derived from the SARS virus in cultured cells. Gene-based 
delivery systems of the invention include viral vectors (e.g., adenovirus vector, poxvirus vector, 
10 alphavims vector) and non-viral nucleic acid vectors (e.g., DNA, RNA) encoding one or more 
SARS virus antigens. Polynucleotides encoding SARS virus antigen(s) are incorporated into the 
gene-based vaccines individually or in combination (e.g., as bicistronic constructs). 
1. Alphavirus 

Alphaviruses are members of Togaviridae family and share common structural and 

15 rephcative properties. Sindbis virus (SIN) is the prototype virusforlhemolecular study of other 
alphaviruses, and together with Venezuelan equine encephalitis virus (VEE) and Semliki Forest 
virus (SFV), are the most widely utilized alphaviruses being developed into expression vectors 
for heterologous genes (Schlesinger and Dubensky (1999) Curr Opin Biotechnol. i0:434-439; 
Schlesinger (2001) Expert Opin. Biol. Ther. 1:177-91). 

*) Alphaviruses possess a relatively small single-stranded RNA genome of positive polarity, - 

which is approximately 12 kb in length, capped and polyadenylated. The RNA interacts with 
viral capsid protein monomers to form nucleocapsids, which in turn, are surrounded by ; a host 
cell-derived lipid envelope from which two viral glycoproteins, El and E2, protrude forming 
"spike" trimers of heterodimeric subunits. Two open reading frames (ORFs) encode as 

5 polyproteins the enzymatic nonstructural replicase proteins (5' ORE) and the virion structural 
proteins (3* ORE). Hie structural polyprotein is translated from a highly abundant subgenomic 
mRNA, which is transcribed from a strong internal alphavirus promoter (Strauss and Strauss 
(1994) Microbiol Rev. 58:491-562). Replication of the genome occurs exclusively within the 
host cell cytoplasm as RNA. 

3 The most common alphavirus expression vectors have exploited both the positive-stranded 

nature and modular organization of the RNA genome. These vectors, termed "replicons" due to 
their property of self-amplification, permit insertion of heterologous sequences in place of the 
structural polyprotein genes, while maintaining the 5*- and 3'-end cis replication signals, the 
nonstructural replicase genes, and the subgenomic junction region promoter (Xiong et al. (1989) 

> Science 245:1188-1191; Liljestrom (1991) Bio/Technology 9:1356-1361). Chimeric alphavirus 

vectors (and particles) from sequences derived from divergent virus families have also been 

-140- 



WO 2004/092360 PCT/US2004/011710 

described, (see, for example United States patent application serial number 09/236,140; see also, 
US Patents 5,789,245, 5,842,723, 5,789,245, 5,842,723, and 6,015,694; as well as WO 95/07994, 
WO 97/38087 and WO 99/18226). Co-owned International Publication WO 02/099035, 
published December 12, 2002 and incorporated by reference in its entirety herein, describes 
chimeric alphavirus molecules and modified alphavirus molecules having modified Biosafety 
Levels. 

The absence of structural protein genes renders alphavirus replicon vectors defective, in 
that RNA amplification and high-level heterologous gene expression occurs within the target 
cell, but cell-to-cell spread of vector is not possible due to the inability to form progeny virions. 
Through the years, several synonymous terms have emerged that are used to describe alphavirus 
replicon particles. These terms include recombinant viral particle, recombinant alphavirus 
particle, alphavirus replicon particle and replicon particle. However, as used herein, these terms 
all refer to a virion-like unit containing an alphavirus-derived RNA vector replicon. Moreover, 
these terms may be referred to collectively as vectors, vector constructs or gene delivery vectors. 

Packaging of replicon RNA into particles can be accomplished by introducing the replicon 
RNA into permissive cells (e.g„ RNA or DNA transection, or particle infection) that also 
contain one or more structural protein expression cassettes or "defective helper" constructs 
encoding the alphavirus structural proteins. These structural protein encoding constructs may 
themselves be introduced into the cells by transfection of either RNA or DNA, and most 
commonly retain the native alphavirus subgenomic promoter, as well as 5'- and 3'-end cis 
signals for co-amplification with the replicon, but are devoid of any rephcase genes and the RNA 
packaging signal (Liljestrom (1991) Bio/Technology 9:1356-1361; Pushko et aL (1997) Virology 
239:389^01; Polo et aL (1999) PNAS 96:4598-4603). Permanent cell lines that are stable 
transformed with constructs expressing the alphavirus structural proteins {e.g., packaging cell 
lines) offer a means to avoid transient transfection production methods (Polo et aL (1999) PNAS 
96:4598-4603). 

The present invention includes compositions and methods for the production of replication 
defective viral vector particles (e.g., alphavirus replicon particles) for use in the ex vivo and in 
vivo administration of heterologous genes encoding proteins having therapeutic or prophylactic 
application, including genes encoding for one or more SARS viral antigens. 

In one aspect, the invention includes a method of producing replication defective viral 
vector particles (e.g, alphavirus replicon particles) comprising the steps of introducing at least 
one nucleic acid molecule comprising a viral vector (e.g., alphavirus replicon RNA) into 
immortalized cells of the present invention, under conditions that allow for complementation of 
the viral vector {e.g., alphavirus replicon RNA) and production of viral vector particles (e.g., 
alphavirus replicon particles), and isolating the viral vector particles from the cells or cell culture 
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supematants. In certain embodiments, the immortalized cells are grown in suspension, for 
example PERC.6 cells. In other embodiments, the methods are performed in large-scale 
volumes, for example, liter volumes or greater, such as for example in roller bottles, large flasks, . 
Nunc Cell Factories, Corning Cell Cubes, fermentation vessels, etc). 
5 In certain embodiments, the viral vector is an alphavirus replicon RNA that requires 

complementation by providing one or more alphavirus structural proteins in trans, within the 
immortalized cell. In such instances, the methods of complementation to produce alphavirus 
replicon particles may involve the introduction of one or more nucleic acids (e.g., RNA, DNA) 
encoding said alphavirus structural protein(s) (eg., capsid and/or envelope glycoproteins) into 
10 the immortalized cells, either transiently or stably, and either concurrent with or prior to the 
introduction of the alphavirus replicon RNA. In certainembodiments, the alphavirus replicon 
RNA is introduced into the cell by transfection an in vitro transcribed RNA. In other 
embodiments, the alphavirus replicon RNA is introduced into the cell by transfection of a DNA 
(e.g., ELVIS), which is capable of transcribing within the cell, the replicon RNA. In yet other 
L5 embodiments, the alphavirus replicon RNA is introduced into the cell by infection with a seed 
stock of alphavirus replicon particles. In certain embodiments, the nucleic acids encoding said 
alphavirus structural protein(s) are defective helper RNA or are DNA that can transcribe within 
the cell defective helper RNAs. 

As discussed herein, "alphavirus RNA replicon vector", "RNA replicon vector", "replicon 
0 vector" or "replicon" refers to an RNA molecule that is capable of directing its own 

amplification or self-replication in vivo, within a target cell. To direct its own amplification, the 
RNA molecule should encode the polymerase(s) necessary to catalyze RNA amplification (e.g. , 
alphavirus nonstructural proteins nsPl, nsP2, nsP3, nsP4) and also contain cis RNA sequences 
required for replication which are recognized and utilized by the encoded polymerase(s). An 
5 alphavirus RNA vector replicon should contain the following ordered elements: 5 % viral or 
cellular sequences required for nonstructural protein-mediated amplification (may also be 
referred to as 5* CSE, or 5' cis replication sequence, or 5 ! viral sequences required in cis for 
replication, or 5' sequence which is capable of initiating transcription of an alphavirus), 
sequences which, when expressed, code for biologically active alphavirus nonstructural proteins 
3 (e.g., nsPl, nsP2, nsP3, nsP4), and 3' viral or cellular sequences required for nonstructural 

protein-mediated amplification (may also be referred as 3' CSE, or 3' viral sequences required in 
cis for replication, or an alphavirus RNA polymerase recognition sequence). The alphavirus 
RNA vector replicon also should contain a means to express one or more heterologous 
sequenced), such as for example, an IRES or a viral (e.g., alphaviral) subgenomic promoter 
> (e.g. y junction region promoter) which may, in certain embodiments, be modified in order to 
increase or reduce viral transcription of the subgenomic fragment, or to decrease homology with 
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defective helper or structural protein expression cassettes, and one or more heterologous 
sequence(s) to be expressed Preferably the heterologous sequence(s) comprises a protein- 
encoding gene, which is the 3* proximal gene within the vector replicon. And preferably the 
replicon further comprises a polyadenylate tract 
5 As discussed herein, "recombinant Alphavirus Particle", "alphavirus replicon particle" and 

"replicon particle" refers to a virion-like unit containing an alphavirus RNA vector replicon. 
Generally, the recombinant alphavirus particle comprises one or more alphavirus structural 
proteins, a lipid envelope and an RNA vector replicon. Preferably, the recombinant alphavirus 
particle contains a nucleocapsid structure that is contained within a host cell-derived lipid 

10 bilayer, such as a plasma membrane, in which one or more alphaviral envelope glycoproteins 
E2, El) are embedded Hie particle may also contain other components (e.g. , targeting 
elements such as biotin, other viral structural proteins or portions thereof, hybrid envelopes, or 
other receptor binding ligands), which direct the tropism of the particle from which the 
alphavirus was derived Generally the interaction between alphavirus RNA and structural 

15 protein(s) necessary to efficiently farm a replicon particle or nucleocapsid may be an RNA- 
protein interaction between a capsid protein and a packaging signal or packaging sequence 
contained within the RNA. 

"Alphavirus packaging cell line" refers to a cell which contains one or more alphavirus 
structural protein expression cassettes and which produces recombinant alphavirus particles 

10 (replicon particles) after introduction of an alphavirus RNA vector replicon, eukaryotic layered 
vector initiation system, or recombinant alphavirus particle. Hie parental cell may be of 
mammalian or non-mammalian origin. Within preferred embodiments, the packaging cell line is 
stably transformed with the structural protein expression cassette(s). 

'Defective helper RNA" refers to an RNA molecule that is capable of being amplified and 

15 expressing one or more alphavirus structural proteins within a eukaryotic cell, when that cell also 
contains functional alphavirus nonstructural "replicase" proteins. The alphavirus nonstructural 
proteins may be expressed within the cell by an alphavirus RNA replicon vector or other means. 
To permit amplification and structural protein expression, mediated by alphavirus nonstructural 
proteins, the defective helper RNA molecule should contain S'-end and 3'-end RNA sequences 

0 required for amplification, which are recognized and utilized by the nonstructural proteins, as 
well as a means to express one or more alphavirus structural proteins. Thus, an alphavirus 
defective helper RNA should contain the following ordered elements: 5* viral or cellular 
sequences required for RNA amplification by alphavirus nonstructural proteins (also referred to 
elsewhere as 5* CSE, or 5* cis replication sequence, or 5' viral sequences required in cis for 

5 replication, or 5* sequence which is capable of initiating transcription of an alphavirus), a means 
to express one or more alphavirus structural proteins, gene sequence(s) which, when expressed, 
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codes for one or more alphavirus structural proteins (e.g., C, E2, El), 3' viral or cellular 
sequences required for amplification by alphavirus nonstructural proteins (also referred to as 3' 
CSE, or 3' viral sequences required in cis for replication, or an alphavirus RNA polymerase 
recognition sequence), and a preferably a polyadenylate tract Generally, the defective helper 
RNA should not itself encode or express in their entirety all four alphavirus nonstructural 
proteins (nsPl, nsP2, nsP3, nsP4), but may encode or express a subset of these proteins or 
portions thereof, or contain sequence(s) derived from one or more nonstructural protein genes, 
but which by the nature of their inclusion in the defective helper do not express nonstructural 
protein(s) or portions thereof. As a means to express alphavirus structural protein(s), the 
defective helper RNA may contain a viral (e.g., alphaviral) subgenomic promoter which may, in 
certain embodiments, be modified to modulate transcription of the subgenomic fragment, or to 
decrease homology with replicon RNA, or alternatively some other means to effect expression of 
the alphavirus structural protein (e.g,, internal ribosome entry site, ribosomal readthrough 
element). Preferably an alphavirus structural protein gene is the 3' proximal gene within the 
defective helper; In addition, it is also preferable that the defective helper RNA does not contain 
sequences that facilitate RNA-protein interactions with alphavirus structural protein(s) and 
packaging into nucleocapsids, virion-like particles or alphavirus replicon particles. A defective 
helper RNA is one specific embodiment of an alphavirus structural protein expression cassette. 

Alphavirus for use in the invention may be grown in any one of the cell lines discussed 
above as suitable for the SARS virus. 

Alphavirus replicon particles may be produced according to the present invention by using 
the above cell lines (e.g., immortalized cell lines) and a variety of published and accepted 
alphavirus vector methodologies. Such methodologies include, for example, transient packaging 
approaches, such as the co-transfection of in vitro transcribed replicon and defective helper 
RNA(s) (Liljestrom, Bio/Technology 9:1356-1361, 1991; Bredenbeek et al, J. Virol 67:6439- 
6446, 1993; Prolov et al, J. Virol 71 :2819-2829, 1997; Pushko et at, Virology 259:389^101, 
1997; US Patents 5,789,245 and 5,842,723) or co-transfection of plasmid DNA-based replicon 
and defective helper constructs) (Dubensky et aL, J. Virol 70:508-519, 1996), as well as 
introduction of alphavirus structural protein expression cassettes (e.g., DNA-based defective 
helper) into immortalized cell lines of the present invention to create stable packaging cell lines 
(PCL) (Polo et at. PNAS 96:4598^603, 1999; US Patents 5,789,245, 5,842,723, 6,015,694; WO 
97/38087, WO 99/18226, WO 00/61772, and WO 00/39318). Stable packaging cell lines may 
then be utilized for alphavirus replicon particle production. The PCL may be transfected with in 
vitro transcribed alphavirus replicon RNA, transfected with a plasmid DNA-based replicon (e.g., 
ELVIS vector), or infected with a seed stock of alphavirus replicon particles, and then incubated 
under conditions and for a time sufficient to produce progeny alphavirus replicon particles in the 
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culture supernatant In addition, progeny replicon particles can subsequently be passaged in 
additional cultures of naive PCL by infection, resulting in further expansion and commercial 
scale preparations. Importantly, by using defective helper RNA or stable PCL based on the 
"split" structural gene configuration, these replicon particle stocks may be produced free from 
5 detectable contaminating RCV. 

Following harvest, crude culture supematants containing the chimeric alphavirus replicon 
particles may be clarified by passing the harvest through a filter (e.g., 0.2 uM, 0.45 uM, 0.65 uM, 
0.8 uM pore size). Optionally, the crude supematants may be subjected to low speed 
centrifugation prior to filtration to remove large cell debris. Within one embodiment, an 
10 endonuclease (e.g., Benzonase, Sigma #E8263) is added to the preparation of alphavirus replicon 
particles before or after a chromatographic purification step to digest exogenous nucleic acid. 
Further, the preparation may be concentrated prior to purification using one of any widely known 
methods (eg., tangential flow filtration). Crude or clarified alphavirus replicon particles may be 
concentrated and purified by chromatographic techniques (e.g. , ion exchange chromatography, 
5 size exclusion chromatography, hydrophobic interaction chromatography, affinity 

chromatography), such as those described in WO01/92552, incorporated by reference in its 
entirety herein. Two or more such purification methods may be performed sequentially. 

EXAMPLE OF ALPHAVIRUS REPLICON PARTICLES ENCODING SARS VIRUS SPIKE (S) 
ANTIGEN 

0 The invention includes compositions and methods for the production of replication 

defective viral vector particles (e.g., alphavirus replicon particles) for use in the ex vivo and in 
vivo a<hninistration of heterologous genes encoding proteins having therapeutic or prophylactic 
application, including genes encoding for one or more SARS viral antigens. 

The following example illustrates a method of preparing alphavirus replicon particles 
5 encoding SARS virus spike (s) antigen. 

The SARS virus spike gene can be incorporated into alphavirus replicon particles derived 
from a variety of alphavirus, such as Sindbis virus, Semliki Forest virus (US 5739026), 
Venezuelan equine encephalitis virus (US 6531135), and replicon particle chimeras derived from 
more than one alphavirus (US 6376236, WO 02/99035). In addition, the SARS virus spike gene 
) can be incorporated in its entirety (encoding full-length spike protein) or in a modified form that 
includes, for example, sequence deletions or truncations, such that the encoded a spike protein is 
of less than full-length (e.g., C-terminal truncation of one or more (e.g. at least 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 15, 20, 25, 30 etc.) amino acids, deleted of transmembrane region and cytoplasmic tail). 
For example, the spike gene may be cloned as a full-length gene into the VCR-chim2.1 
• vector (WO 02/99035) by standard RT-PCR conditions or by standard subcloning from one of 
the other plasmids described herein, using commercially available restriction endonucleases. For 
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the reverse transcription step in standard RT-PCR, the Superscript pre-amplification kit 
(Invitrogen™) and the primer SEQ ID NO: 7325 (sp-RT-R) are used: 

For the amplification step, the cDNA polymerase advantage kit (Clonetech) and two 
primers Sp-F-BbvCT (SEQ ID NO: 7326) and Sp-R-NotI (SEQ ID NO: 7327) are used: 

The forward primer is designed to contain the ccacc sequence (Kozak, 1991 JBC 19867- 
70) in front of the ATG codon to optimize translation efficiency of me spike gene. Also, the 
forward primer contains the BbvCI restriction site and the reverse primer contains the Noil 
restriction site for subsequent cloning of the PCR amplified gene. 

The PCR product is purified using the QIAquick Nucleotide Removal kit (QIAgen), 
digested with BbvCI and NotI, gel purified with QIAquick Gel Extraction kit (QIAgen), and 
ligated to plasmid VCR-Chim2.1 pre-digested with the same enzymes. Clones containing the 
SARS spike sequence are verified by sequencing and the new construct is called VCR-Chim2.1- 
SARSspike. 

To generate VBErep/SINenv-SARSspike replicon particles the plasmids VCR-Cbim2.1- 
SARSspike, VCR-DH-Scap (WO 02/99035), and VCR-DH-Sglydll60 (WO 02/99035) are 
linearized with the restriction enzyme Pmel and used for in vitro transcription as described 
previously (Polo et aL 1999, PNAS 96: 4598-603; WO02/99035). The transcripts are co- 
transfected into BHK cells as previously described (Polo et aL, 1999, ibid ; WO02/99035). The 
transfected cells are incubated at 34 °C, the supernatants collected at 20 and 30 hrs post- 
electroporation, clarified by centrifugation, and purified by chromatography as previously 
described (WO 01/92552). 

, Expression of the SARS spike protein from the replicon particle vector is verified by 
infecting BHK cells overnight with purified VEErep/SINenv-SARSspike or VEErep/SINenv- 
GFP (WO 02/99035) replicon particles. In addition, BHK cells also were transfected in parallel 
with in vitro transcribed VCR-Chim2.1-SARSspike rephcon RNA. At 16 hrs post-infection and 
transfection cells are lysed and a sample of the lysate analyzed by western blot using an antibody 
that recognizes SARS virus spike protein. The proteins on the gel are stained or transferred to a 
membrane for Western blot analysis with sera from convalescent patients or alternatively murine 
or rabbit antisera generated against SARS virus. VEErep/SINenv-SARSspike rephcon particles 
are administered to the vaccine recipient (e.g., rodent, non-human primate, human) as described 
elsewhere in the present invention. 

Figure 67 shows data from western blot analysis performed under non-reducing conditions, 
using a SARS virus specific rabbit polyclonal antisera. The western data demonstrate that not 
only is SARS spike protein expressed in cells infected with alphavirus rephcon particles or 
transfected with rephcon RNA but the predominant form of spike is that of a homotrimer 
(Rg.67A). Similar homotrimeric association of the spike protein was observed in western blots 
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of SARS virions purified from SARS vims infected VERO cell supernatants, and this 
homotrimer is heat labile, as indicated by the dissociation into monomelic forms at 80°C and 
100°C (Hg.67B). 

To further characterize SARS Spike protein expression and processing following 
5 expression from alphavirus replicon vectors, BHK-21 cells were infected with alphavirus 
replicon particles expressing the full-length Spike. At 6 hr post-infection with an MOI of 5, 
infected cells were labeled for 1 hr with I^[ 35 S]methionine/cysteine and chased for the indicated 
time. The [^S] -labeled spike protein was immunoprecipitated by anti-SARS rabbit serum and 
digested with Endo~EL Both digested and undigested proteins were analysed by 4% 
10 polyacrylamide-SDS PAGE under reducing conditions. As shown in Figure 55, the full-length 
spike protein is synthesized as an Endo-H sensitive high mannose glycoprotein (gpl70, an ER 
form) that undergoes modification to an Endo-H resistant glycoprotein with complex 
oligosaccharides (gpl80, a Golgi form). The conversion of gpl70 into the gpl80 form takes 
place within 2 hr. 

15 Alphavirus replicon particles expressing one or more SARS proteins (e.g., 

VEErep/SINenv-SARSspike replicon particles) are administered to the vaccine recipient in order 
to induce a SARS specific immune response (eg., rodent, ferret, non-human primate, human) as 
described elsewhere in the present invention. Immunization may be performed through a variety 
of routes, including for example, intramuscular, subcutaneous, intradermal, and intranasal. In 

20 additon, the alphavirus replicon particles may be used alone or in combination (e.g., "prime- 
boost") with other vaccine approaches of the present invention, or alternatively the alphavirus 
replicon particles may co-express antigen from other respiratory pathogens or be co-administered 
in combination with alphavirus replicon particles expressing antigens from other respiratory 
pathogens (e.g:, influenza virus, parainfluenza virus, respiratory syncytial virus, human 

J5 metapneumovinis). For example, the induction of anti-spike protein antibodies in animals 
immunized IM with VEErep/SINenv-SARSspike replicon particles was demonstrated in mice 
(Figure 68). These mouse studies also included addtional vaccine groups for comparison, 
including the inactivated SARS virus and recombinant truncated spike protein vaccines describe 
elsewhere herein, as well as plasmid DNA used as a prime, followed by alphavirus replicon 

50 particles as a boost The data clearly show very potent immune responses for all vaccine groups, 
including the alphavirus replicon particle group. It should be noted that the level of antibody 
induced by the inactivated SARS virus vaccine used in these experiments has been shown to be 
protective in a SARS virus animal challenge model. 

Similarly, genes encoding other SARS virus antigens (e.g., nucleocapsid protein, 

15 membrane glycoprotein) are cloned into alphavirus replicon vectors, either individually or in 
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combination, to generate alphavirus replicon particles according to the teachings of the present 
invention and using standard molecular biology techniques.. 

EXAMPLE OF ALPHA V7J? U S-BASED PLASMID DNA EXPRESSING SARS VIRUS SPIKE 

(S) 

The invention includes preparation of plasmid DNA expressing a SARS virus antigen for 
prophylactic or therapeutic immunization against SARS virus infection. In one embodiment, the 
SARS viral antigen is a spike (S) protein. In one embodiment, the plasmid DNA is alphavirus- 
based 

The following example illustrates one method for preparing an alphavirus-based plasmid 
DNA expressing SARS virus spike (S). 

SARS spike gene can be delivered using any of the alphavirus-based plasmid DNA 
replicons such as ELVS (Dubensky et al, 1996 J Virol. 70: 508-19), SINCP (WO 01/81609), or 
VCP (PCT WO 02/99035). 

For example, the SARS spike gene is cloned into SINCP using the standard RT-PCR 
techniques. The oligo Sp-RT-R is used for the reverse transcription step with the Superscript pre- 
amplification kit (Invitrogen). For the amplification step, the cDNA polymerase advantage kit 
(Clonetech) with the Sp-R-NotI and Sp-F-Xhol (SEQ ID NO: 7328) primers is used. - 

The Sp-F-Xhol primer was designed to contain the ccacc sequence in front of the ATG 
codon to optimize traiislauoneffiri Also, the primer 

contains the Xhol restriction site for the subsequent cloning of the PCR amplified gene. 

The PCR product is purified using the QIAquick Nucleotide removal kit, digested with 
Xhol and NotI, gel purified with QIAquick Gel Extraction kit, and ligated to plasmid SINCP pre- 
digested with the same enzymes. Clones containing the SARS spike sequence are verified by 
sequencing and the new construct is called SINCP-SARSspike. 

Expression of the SARS spike gene is verified by transient transfection of BHK cells with 
2ug of either plasmid DNA SINCP-SARSspike or SINCP pre-incubated for 5 minutes with 5 jd 
of Transrr Polyamine reagent (Minus) in low serum medium Optimem (life Technologies). At 
48 hrs pos-transfection cells are lysed and a sample of the lysate is run on 8% SDS-PAGE. The 
proteins on the gel are either stained or transferred to a membrane for Western blot analysis with 
sera from convalescent patients, or alternatively with sera from mouse or rabbits. 

SINCP-SARSspike plasmid replicon is administered to the vaccine recipient (e.g., rodent, 
non-human primate, human) as a formulated or unformulated plasmid vaccine, alone or in 
combination (e.g., "prime-boost") with other vaccines of the present invention, as described 
elsewhere herein. 

Similarly, genes encoding other SARS virus antigens (e.g., nucleocapsid protein, 
membrane glycoprotein) are cloned into alphavirus plasmid replicon vectors. 
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2. Plasmid Expression Vectors 

EXAMPLE OF PLASMID DNA EXPRESSING SARS VIRUS SPIKE (S) 

Hie following example illustrates a method for preparing plasmid DNA expressing SARS 
virus spike (s). 

5 The SARS virus spike antigen also may be delivered using other plasmid DNA expression 

vectors (sometimes referred to as "conventional" DNA vaccines), based on a polymerase II 
promoter, such as, for example, a CMV promoter. A DNA vaccine of the spike antigen gene 
induces an antibody response in mice (Zhao et aL (2004) Acta Biochim et Biophysica Sinica 
36:37^11), and has been found to induce viral neutralization and protective immunity in mice 

10 (Yang et aL (2004) Nature 428:561-564), particularly when truncated at the C-terminus. 

For example, the SARS spike gene is cloned into pCMVKm2 (Zur Megede et al. 9 J. Virol., 
74:2628-2635, 2000; SEQ ID NO: 9923) using standard RT-PCR techniques. The oligo Sp-RT- 
R is used for the reverse transcription step with the Superscript pre-amplification kit 
Otavitrogen). For the amplification step, the cDNA polymerase advantage kit (Clonetech) is used 

15 with primers Sp-E-EcoRI (SEQ ID NO: 7329) and Sp-R-Xbal (SEQ ID NO: 7330). . . 

The forward primer was designed to contain the CCACC (SEQ ID NO: 733 1) sequence in 
front of the ATG codon to optimize translation efficiency (Kozak 1991, ibid.) of the spike gene. 
Also, the forward primer contains the EcoRI restriction site and the reverse primer contains the 
Xbal restriction site for the subsequent cloning of the PCR amplified gene. 

10 The PCR product is purified using the QIAquick Nucleotide Removal kit, digested with 

Xhol and NotI, gel purified with QIAquick Gel Extraction kit, and ligated to plasmid .-: 
pCMYKm2 pre-digested with the same enzymes. Clones containing the SARS spike sequence 
are verified by sequencing and the new construct is called pCMVKm2-SARSspike. 

Expression of the SARS spike gene is verified by transient transfection of BHK or 293 

\5 cells with 2\ig of either plasmid DNA pCMVKm2-S ARSspike or pCMVKm2 pre-incubated for 
5 minutes with 5 pi of TransIT Polyamine reagent (Minus) in low serum medium Optimem (life 
Technologies). At 48 hrs pos-transfection cells are lysed and a sample of the lysate is run on 8 % 
SDS-PAGE. Hie proteins on the gel are either stained or transferred to a membrane for Western 
blot analysis with sera from convalescent patients, or alternatively using mouse or rabbit 

0 antisera. 

Plasmid pCMVKm2-S ARSspike is administered to the vaccine recipient (e.g., rodent, non- 
human primate, human) as a formulated or unformulated plasmid vaccine, as described 
elsewhere in the present invention. 

Similarly, genes encoding other SARS virus antigens (e.g., nucleocapsid protein, 
5 membrane glycoprotein) are cloned into plasmid expression vectors 
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3. Virus-like Particles comp risin g SARS antig ens 

The SARS viral antigens of the invention may be formulated into Virus like Particles 
("VLPs"). The invention thus includes virus-like particles (or VLPs) comprising one or more 
SARS viral antigens. Preferably, the VLPs comprise one or more SARS viral antigens selected 
from the group consisting of Spike (S), nucleocapsid (N), membrane (M) and envelope (E). 
Preferably, the VLPs comprise at least M and E 

The VLPs of the invention comprise at least one particle-forming polypeptide. Said 
particle-forming polypeptide is preferably selected from a Coronavirus structural protein. In one 
embodiment, the particle-forming polypeptide is selected from one or more SARS viral antigens. 
In another embodiment, the particle-forming polypeptide is selected from the structural protein 
of a non-SARS Coronavirus, such as, for example, Mouse Hepatitis Virus. 

VLPs can be formed when viral structural proteins are expressed in eukaryotic or 
prokaryotic expression systems. Upon expression, the structural proteins self-assemble to form 
particles. Alternatively, viral structural proteins may be isolated from whole virus and • 
formulated with phospholipids. Such viral structural proteins are referred to herein as "particle- 
forming polypeptides". VLPs are not infectious because no viral genome is present, however, 
these non-replicating, virus capsids mimic the structure of native virions. 

Due to their structure, VLPs can display a large number of antigenic sites on their surface 
(similar to a native virus). VLPs offer an advantage to live or attenuated vaccines in that they are 
much safer to both produce and administer, since they are not infectious. VLPs have been 
shown to induce both neutralizing antibodies as well as T-cell responses and can be presented by 
both class I and n MHC pathways. 

Previous work creating VLPs from coronavirus indicates that E and M proteins along may 
be sufficient for coronavirus VLP formation. See Hscher et aL, J. Virol. (1998) 72:7885-7894 
and Vennema et al. EMBO J. (1996) 15:2020-2028. 

Chimeric VLPs comprising particle-forming polypeptides or portions thereof from non- 
SARS Coronaviruses are also included in the invention. Such particle-forming polypeptides may 
comprise a full length polypeptide from a non-SARS Coronavirus. Alternatively, a particle- 
forming fragment may be used 

In one embodiment, a fragment of a non-SARS particle-forming polypeptide and a 
fragment of a SARS viral antigen are fused together. For instance, such chimeric polypeptides 
may comprise the the endodomain and transmembrane domain of a non-SARS particle-forming 
polypeptide and the ectodomain of a SARS viral antigen. In one example, the VLPs of the 
invention comprise a chimeric spike protein comprising an endodomain and transmembrane 
domain of the spike protein of Mouse Hepatitis Virus (MHV) and the chimeric spike protein 
further comprises the ectodomain of the SARS spike protein. Such VLPs may further comprise 
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Coronavirus M and E proteins. Said M and E proteins may be selected from any coronavirus, 
including Mouse Hepatitis Virus (MHV) or SARS. Sample sequences of S, M and E proteins of 
MHV are included in the figures, supra. 

Chimeric spike proteins derived from the ectodomain of feline infectious peritonitis virus 
5 (HPV) spike protein fused to the endo and transmembrane domains of MHV spike protein have 
been previously disclosed. See WO 98/49195 and WO 02/092827. In these chimeric VLP 
structures, the capsid structure of the VLPs is formed by the M and E protein of MHV. The 
chimeric spike protein provides for the surface exposure of the ectodomain of the MPV spike 
protein. 

0 As used herein, the term "virus-like particle" or "VLP" refers to a non-rephcating, empty 

virus shell. VLPs are generally composed of one or more viral proteins, such as, but not limited 
to those proteins referred to as capsid, coat, shell, surface and/or envelope proteins, or particle- 
forming polypeptides derived from these proteins. VLPs can form spontaneously upon 
recombinant expression of the protein in an approrpirate expression system. Alternatively, viral 
5 structural proteins may be isolated from whole virus and formulated with phospholipids. 
Methods for producing particular VLPs are known in the art and discussed more fully below. 
The presence of VLPs in a composition can be detected using conventional techniques known in 
the art, such as by electron microscopy, x-ray crystallography, and the like. See, e.g., Baker et 
aL, Biophys. J. (1991) 60:1445-1456; Hagensee et aL, J. Virol (1994) 68:4503^505. For 
example, cryoelectron microscopy can be performed on vitrified aqueous samples of the VLP 
preparation in question, and images recorded under appropriate exposure conditions. 

The phrase "particle-forming polypeptide" includes a full-length or near full-length viral 
protein, as well as a fragment thereof, or a viral protein with internal deletion, which has the 
ability to form VLPs under conditions that favor VLP formation. Accordingly, the polypeptide 
may comprise the full-length sequence, fragments, truncated and partial sequences, as well as 
analogs and precursor forms of the reference molecule. The term therefore includes deletions, 
additions and substitutions to the sequence, so long as the polypeptide retains the ability to form 
a VLP. Thus, the term includes natural variations of the specified polypeptide since variations in 
coat proteins often occur between viral isolates. The term also includes deletions, addition and 
substitutions that do not naturally occur in the reference protein, so long as the protein retains the 
ability to form a VLP. 

Preferred substitutions are those which are conservative in nature, Le., those substitutions 
that take place within a family of amino acids that are related in their side chains. Specifically, 
amino acids are generaUy divided into four families: (1) acidic: aspartate and glutamate; (2) 
basic: lysine, arginine, and bistidine; (3) non-polar alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar glycine, asparagine, glutamine, 
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cystine, serine, theronine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes 
classified as aromatic amino acids. For example, it is reasonably predictable that an isolated 
replacement of leucine with isoleucine or valine, an asparate with a glutamate, a threonine with a 
serine, or a similar conservative replacement of an amino acid with a structurally related amino 

5 acid, will not have a major effect on the biological activity. Proteins having substantially the 
same amino acid sequence as the reference molecule, but possessing minor amino acid 
substitutions that do not substantially affect the immunogenicity of the protein, are therefore 
within the definition of the reference polypeptide. 

The VLPs of the invention can be formed from any viral protein, particle-forming 

10 polypeptide derived from the viral protein, or combination of viral proteins or fragments thereof, 
that have the capability of forming particles under appropriate conditions. The requirements for 
the particle-forming viral proteins are that if the particle is formed in the cytoplasm of die host 
cell, the protein must be sufficiently stable in the host cell in which it is expressed such that 
formation of virus-like structures will result, and that the polypeptide will automatically 

15 assemble into a virus-like structure in the cell of the recombinant expression system used. If the 
protein is secreted into culture media, conditions can be adjusted such that VLPs will form. 
Furthermore, the particle-forming protein should not be cytotoxic in the expression host and 
should not be able to replicate in the host in which the VLP will be used. 

Preferred particle-forming polypeptides include coronavirus M and E proteins, preferably 

10 SARS M and E proteins. 

Methods and suitable conditions for forming particles from a wide variety of viral proteins 
are known in the art. VLPs have been produced, for example from proteins derived from 
influenza virus (such as HA or NA), Hepatitis B virus (such as core or capsid proteins), Hepatitis 
E virus, measles virus, Sindbis virus, Rotavirus, Foot-and-Mouth Disease virus, Retrovirus, 

15 Norwalk virus, human Papilloma virus, HIV, RNA-phages, QB-phage (such as coat proteins), 
GA-phage, fr-phage, AP205 phage, and Ty (such as retrotransposon Ty protein pi). VLPs are 
discussed further in WO 03/024480, WO 03/024481, and Niikura et al., Virology (2002) 
293:273-280; Lenz etaL 9 J. Immunology (2001) 5246-5355; Pinto, et al. 9 /. Infectious Diseases 
(2003) 188:327-338; and Gerber et aL t J. Virology (2001) 75(10):4752^760. 

10 As explained above, VLPs can spontaneously form when the particle-forming polypeptide 

of interest is recombinantly expressed in an appropriate host cell. Thus, the VLPs for use in the 
present invention may be prepared using recombinant techniques, well known in the art In this 
regard, genes encoding the particle-forming polypeptide in question can be isolated from DNA 
libraries or directly from cells and tissues containing the same, using known techniques. The 

»5 genes encoding the particle-forming polypeptides can also be produced synthetically, based on 
the known sequences. The nucleotide sequence can be designed with the appropriate codons for 
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the particular amino sequence desired In general, one will select preferred codons for the 
intended host in which the sequence will be expressed (e.g. human codons for human DNA 
vaccines). Hie complete sequence is generally assembled from overlapping oligonucleotides 
prepared by standard methods and assembled into a complete coding sequence. See., e.g., Edge, 
5 Nature (1981) 292:756; NambairefaL Science (1984) 223:1299; Jay etaL, J. Biol Chem.(\9U) 
259:6311. 

Once the coding sequences for the desired particle-forming polypeptides have been 
isolated or synthesized, they can be cloned into any suitable vector or replicon for expression. 
Numerous cloning vectors are known to those of skill in the ait, and the selection of an 

10 appropriate cloning vector is a matter of choice. See, generally, Sambrook et al. The vector is 
then used to transform an appropriate host cell. Suitable expression systems include, but are not 
limited to, bacterial, mammalian, bacuolvirus/insect, vaccinia, Semliki Forest virus (SFV), yeast, 
and Xenopus expression systems, well known in the art 
* A number of cell lines suitable for use as host cells for producing the VLPs of the 

15 invention are known in the art Suitable mammalian cell lines include, but are not limited to, 
Chinese Hamster Ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey 
kidney cells (COS), human hepatocellular carcinoma cells (e.g. y Hep G2), Madin-Darby bovine 
kidney ("MDBK") cells, as well as others. Mammalian sources of cells include, but are not 
limited to, human or non-human primate (e.g., MRC-5 (ATCC CCH71), WI-38 (ATCC CCLr 

20 75), HUH, human embryonic kidney cells (293 cells, typically transformed by sheared 

adenovirus type 5 DNA), VERO cells from monkey kidneys (including, for example COS7 
cells), horse, cow (e.g., MDBK cells), sheep, dog (e.g., MDCK cells from dog kidneys, ATCC 
CCL34 MDCK (NBL2) or MDCK 33016, deposit number DSM ACC 2219 as described in WO 
97/37001), cat, and rodent (e.g., hamster cells such as BHK21-F, HKCC cells, or Chinese 

15 hamster ovary cells (CHO cells)), and may be obtained from a wide variety of developmental 
stages, including for example, adult, neonatal, fetal, and embryo. 

Bacterial hosts suitable for production of VLPs of the invention include E. coli, Bacillus 
subtilis, and Streptoccocus spp. Yeast hosts suitable for production of VLPs of the invention 
include Saccharomyces cerevisiae, Candida albicans, Candida maltosa, Hansemda polymorpha, 

JO Kluyveromycesfragilis, Kluyveromyces lactis, Pichia guillerimondii, Pichia pastoris, 

Schizosaccharomyces pombe and Yarrowia lipolytica. Insect cells suitable for production of 
VLPs of the invention (i.e. 7 via baculovirus expression vectors) include Aedes aegypti y 
Autographa calif brnica, Bombyx mori, Drosophila melanogaster, Spodptera frugiperda 1 and 
Trichoplusia ni. 

15 Viral vectors can be used for the production of particles in eukaryotic ceUs, such as those 

derived from the pox family of viruses, including vaccinia virus and avian poxvirus. Additional, 

-153- 



WO 2004/092360 



PCT/US2004/011710 



vaccinia based Mection/transfection systems, such as those as described in Tomei et aL, J. Virol 
(1993) 67:4017-4026 and Selby etaL.J.Gen. Virol. (1993) 74:1 103-11 13, can also be used to 
generate the VLPs of the invention. In this system, cells are first transfected in vitro with a 
vaccinia virus recombinant that encodes the bacteriophage T7 RNA polymerase. This 

5 polymerase only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the DNA of interest, driven by a T7 promoter. The polymerase expressed in the 
cytoplasm from the vaccinia virus recombinant transcribes the transfected DNA into RNA which 
is then translated into protein by the host translation machinery. The method provides for high 
level, transient, cytoplasmic production of large quantities of RNA and its translation products. 

10 Depending on the expression system and host selected, the VLPs are produced by growing 

host cells transformed by an expression vector under conditions whereby the particle-forming 
polypeptide is expressed and VLPs can be formed. Hie selection of the appropriate growth 
conditions is within the skill of the art. If the VLPs are formed intracellularly, the cells are then 
disrupted, using chemical, physical or mechanical means, which lyse the cells yet keep the VLPs 

.5 substantially intact. Such methods are known the those of skill in the art and are described in,. 
e.g. 9 Protein Purification Applications: A Practical Approach, (EJLV. Harris and S. Angal, 
Eds., 1990). . 

The particles are then isolated using methods that preserve the integrity thereof, such as by . 
gradient centrifugation, e.g., cesium chloride (CsCl) and sucrose gradients, and the like (see, e.g., 

O Kirnbauer etaL,J. Virol (1993) 67:6929-6936), ion exchange chromatography (including anion 
exchange chromatography such as DMAE and TMAE), hydroxyapatitie chromatography (see 
WO 00/09671), hydrophobic interaction chromatography, gel filtration chromatography and 
other filtration methods such as nanometric filtration and ultrafiltration. Preferably at least one 
anion exchange step is performed during purification, and more preferably at least two anion 

5 exchange steps are used. 

VLP formulations of the invention may be further processed by methods known in the art 
to disassemble the VLPs into smaller, protein containing moieties using a high concentration of 
reducing agent, followed by reassembly of the VLPs by either removal of the reducing agent or 
by addition of excess oxidant The resulting reassembled VLPs may have improved 

0 homogeneity, stability and immunogenic properties. In addition, further therapeutic or 

prophylactic agents may be formulated into the VLPs upon reassembly. See McCarthy etaL 9 J. 
Virology (199S)12(l):32Ah See also WO 99/13056 and WO 01/42780. Reducing agents 
suitable for use in VLP disassembly include sulfhydryl reducing agents (such as glutathion, beta 
mercaptoethanol, dithiothreitol, dithioerythritol, cysteine, hydrogen sulfide and mixtures thereof) 

5 preferably contained in moderate to low ionic strength buffers. Sufficient exposure time of the 
VLPs to the reducing agent will be required to achieve a suitable amount of VLP disassembly. 
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Adjuvants may be added to the VLPs of the invention to enhance the immunogenicity of 
the S ARS viral antigens. Antigens suitable for use with VLPs include those described, supra. 
For example, the VLPs of the invention may be adsorbed onto an aluminum adjuvant 

The VLPs of the invention may formulated to enhance their stability. Additional 
components which may enhance the stability of a VLP formulation include salts, buffers, non- 
ionic surfactants and other stabilizers such as polymeric polyanion stabilizers. See WO 
00/45841. 

The ionic strength of a solution comprising VLP particles may be maintained by the 
presence of salts. Almost any salt which can contribute to the control of the ionic strength may 
be used. Preferred salts which can be used to adjust ionic strength include physiologically 
acceptable salts such as NaCl, KC1, Na 2 S0 4 , (NKO2SO4, sodium phosphate and sodium citrate. 
Preferably, the salt component is present in concentrations of from about 0.10 M to 1 M. Very 
high concentrations are not preferred due to the practical limitations of parenteral injection of 
high salt concentrations. Instead, more moderate salt concentrations, such as more physiological 
concentrations of about 0.15M to about 0.5M with 0.15M-0.32M NaCl are preferred 

Buffers may also be used to enhance the stability of the VLP formulations of the invention. 
Preferably, the buffer optimizes the VLP stability while maintaining the pH range so that the 
vaccine formulation will not be irritating to the recipient. Buffers preferably maintain the pH of. 
the vaccine formulation within a range of p/H 5.5-7.0, more preferably 6.0-6.5. Buffers suitable 
for vaccine formulations are known in the art and include, for example, histidine and imidazole. 
Preferably, the concentration of the buffer will range from about 2mM to about 100 mM, more . 
preferably 5 mM to about 20 mM. Phosphate containing buffers are generally not preferred 
when the VLP is adsorbed or otherwise formulated with an aluminum compound. 

Non-ionic surfactants may be used to enchance the stability of the VLP formulations of the 
invention. Surfactants suitable for use in vaccine formulations are known in the art and include, 
for example, polyoxyethylene sorbital fatty acid esters (Polysorbates) such as Polysorbate 80 
(e.g., TWEEN 80), Polysorbate 20 (e.g. y TWEEN 20), polyoxyethylene alkyl ethers (e.g., Brij 
35, Brij 58), as well as others, including Triton X-100, Triton X-114, NP-40, Span 85 and the 
Pluronic series of non-ionic surfactants (e.g., Pluronic 121). The surfactant is preferably present 
in a concentration of from about 0.0005% to about 0.5% (wt/vol). 

Polymeric polyanion stabilizers may also be used to enchance the stability of the VLP 
formulations of the invention. Suitable polymeric polyanionic stabilizers for use in the invention 
comprise either a single long chain or multiple cross linked chains; either type possessing 
multiple negative charges along the chains when in solution. Examples of suitable polyanionic 
polymers include proteins, polyanions, peptides and polynucelic acids. Specific examples 
include carboxymethyl cellulose, heparin, polyamino acids (such as poly(Glu), poly(Asp), and 
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Poly (CHu, Phe), oxidized glutathione, polynuceltodies, RNA, DNA and seium albumins. The 
concentration of the polmeric polyanion stabilizers is preferably from about 0.01% to about 
0.5%, particularly about 0.05-0.1% (by weight). 

G. Passive Immunization via Antibodies to the SARS Antigens of the Invention 
5 Hie invention includes antibodies specific to the SARS antigens of the invention and 

methods of treatment or prevention of SARS virus related disease by administrating an effective 
amount of SARS antibodies to a mammalian subject Antibodies specific the SARS antigens can 
be produced by one skilled in the art Preferably, the antibodies are specific to the spike (S) 
protein of the SARS virus. Potent neutralization of the SARS coronavirus using a human 
10 monoclonal anti-spike antibody has been reported (Sui et al. (2004) PNAS USA 101:2536-2541). 
A IgGl form of the monoclonal antibody showed a higher affinity (1.59 nM) than a scFv form 
(32.3 nM). 

Hie antibodies of the invention are specific and selective to SARS antigens. 
In one embodiment, the antibodies of the invention are generated by administering a SARS 
15 antigen to an animal. The method may also include isolating the antibodies from the animal. 

The antibodies of the invention may be polyclonal or monoclonal antibody preparations, 
monospecific antisera, human antibodies, or may be hybrid or chimeric antibodies, such as 
humanized antibodies, altered antibodies (Fab')2 fragments, F(ab) fragments, Fv fragments, 
single-domain antibodies, dimeric or trimeric antibody fragments or constructs, minibodies, or 
10 functional fragments thereof which bind to the antigen in question. 

Antibodies are produced using techniques well known to those of skill in the art and 
disclosed in, for example, US Patent Nos. 4,011,308; 4,722,890; 4,016,043; 3,876,504; 
3,770,380; and 4372,745. For example, polyclonal antibodies are generated by immunizing a 
suitable animal, such as a mouse, rat, rabbit, sheep, or goat, with an antigen of interest In order 
!5 to enhance immunogenicity, the antigen can be linked to a carrier prior to immunization. Such 
carriers are well known to those of ordinary skill in the art. Immunization is generally performed 
by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund's 
complete adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously 
or intramuscularly). Hie animal is generally boosted 2-6 weeks later with one or more injections 
0 of the antigen in saline, preferably using Freund's incomplete adjuvant Antibodies may also be 
generated by in vitro immunization, using methods known in the art Polyclonal antiserum is 
then obtained from the immunized animal. 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
(1975) Nature 256:495-497. or a modification thereof. Typically, a mouse or rat is immunized as 
5 described above. Rabbits may also be used. However, rather than bleeding the animal to extract 
serum, the spleen (and optionally several large lymph nodes) is removed and dissociated into 
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single cells. If desired, the spleen cells may be screened (after removal of non-specifically 
adherent cells) by applying a cell suspension to a plate or weU coated with the antigen. B-cells, 
expressing membrane-bound immunoglobulin specific for the antigen, will bind to the plate, and 
are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
5 cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, iC HAT")- The resulting 
hybridomas are plated by limiting dilution, and are assayed for the production of antibodies 
which bind specifically to the immunizing antigen (and which do not bind to unrelated antigens). 
The selected monoclonal antibody-secreting hybridomas are then cultured either in vitro (e.g., in 

10 tissue culture bottles or hollow fiber reactors), or in vivo (e.g. , as ascites in mice). 

Humanized and chimeric antibodies are also useful in the invention. Hybrid (chimeric) 
antibody molecules are generally discussed in Winter et aL (1991) Nature 349: 293-299 and US 
Patent No. 4,816,567. Humanized antibody molecules are generally discussed in Riechmann et 
aL (1988) Nature 332:323-327; Verhoeyan et al. (1988) Science 239: 1534-1536; and U1K. 

15 Patent Publication No. GB 2,276,169, published 21 September 1994). One approach to 

engineering a humanized antibody involves cloning recombinant DNA containing the promoter, 
leader, and variable-region sequences from a mouse antibody gene and the constant-region exons . 
from a human antibody gene to create a mouse-human chimera, a humanized antibody. See 
generally, Kuby, "Immunology, 3 rd Edition", W.H. Freeman and Company, New York (1998) at 

JO page 136. 

Antibody fragments which retain the ability to recognize a SARS antigen are also included . 
within the scope of the invention. A number of antibody fragments are known in the art which 
comprise antigen-binding sites capable of exhibiting immunological binding properties of an 
intact antibody molecule. For example, functional antibody fragments can be produced by 

15 cleaving a constant region, not responsible for antigen binding, from the antibody molecule, 
using e.g., pepsin, to produce F(ab , ) 2 fragments. These fragments will contain two antigen 
binding sites, but lack a portion of the constant region from each of the heavy chains. Similarly, 
if desired, Fab fragments, comprising a single antigen binding site, can be produced, e.g. 9 by 
digestion of polyclonal or monclonal antibodies with papain. Functional fragments, includnig 

50 only the variable regions of the heavy and light chains, can also be produced, using standard 
techniques such as recombinant production or preferential proteolytic cleavage of 
immunoglobulin molecules. These fragments are known as F v . See, e.g. y Inbar et aL (1972) 
Proc. Nat. Acad. Sci USA 69:2659-2662; Hochman et aL (1976) Biochem 15:2706-2710; and 
Ehrlich et aL (1980) Biochem 19:4091^096. 

»5 A single-chain Fv ("sFv" or scFv") polypeptide is a covalently linked V H -V L heterodimer 

which is expressed from a gene fusion including V H -and V L - encoding genes linked by a peptide- 
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encoding linker. Huston et aL (1988) Proc. Nat. Acad. ScL USA 85:5879-5883. A number of 
methods have been described to discern and develop chemical strucutres (linkers) for converting 
the naturally aggregated, but chemically separated, light and heavy polypeptide chains fiom an 
antibody V region into an sFv molecule which will fold into a three dimensional structure 
5 substantially similar to the structure of an antigen-binding site. See, e.g. 9 US Patent Nos. 
5,091,513; 5,132,405; and 4,946,778. The sFv molecules may be produced using methods 
described in thea it See, e.g., Huston et aL (1988) Proc. Nat. Acad. Sci USA 85:5879-5338; US 
Patent Nos. 5,091,513; 5,132,405 and 4,946,778. Design criteria include determining the 
appropriate length to span the distance between the C-terminus of one chain and the N-teiminus 
10 of the other, wherein the linker is generally formed from small hydrophilic amino acid residues 
that do not coil or form secondary structures. Such methods have been described in the art See, 
e.g. 7 US Patent Nos. 5,091,513; 5,132,405 and 4,946,778. Suitable linkers generally comprise 
polypeptide chains of alternating sets of glycine and serine residues, and may include glutamic 
acid and lysine residues inserted to enhance solubility. Anti-spike scPv antibodies have been 
5 reported (Sui etaL (2004) PNAS USA 101:2536-2541). 

"Mini-antibodies" or "minibodies" will also find use with the present invention. 
Minibodies are sFv polypeptide chains which include oligomerization domains at their C- 
tennini, separated from the sFv by a hinge region. Pack et aL, (1992) Biochem 31: 1579-1584. 
The oligomerization domain comprises self-associating a-helices, e.g., leucine zippers, that can 
0 be further stabilized by additional disulfide bonds. The oligomerization domain is designed to be 
compatible with vectorial folding across a membrane, a process thought to facilitate in vivo 
folding of the polypeptide into a functional binding protein. Generally, minibodies are produced 
using recombinant methods well known in the art See, e.g., Pack et aL, (1992) Biochem 
31:1579-1584; Cumber et al. (1992) /. Immunology 149B:120-126. 

> Non-conventional means can also be used to generate and identify the antibodies of the 
invention. For example, a phage display library can be screened for antibodies which bind to the 
SARS antigens of the invention. See generally, Siegel, "Recombinant Monoclonal Antibody 
Technology", Transfus. Clin. BioL (2002) 9(1): 15-22; Sidhu, "Phage Display in Pharmaceutical 
Biotechnology", Curr. Opin. BiotechnoL (2000) 11(6):610-616; Sharon, et al., Recombinant 

> Polyclonal Antibody libraries", Comb. Chem. High Throughput Screen (2000) 3(3): 185-196; 
and Schmitz et al. , "Phage Display: A Molecular Tool for the Generation of Antibodies- 
Review", Placenta, (2000) 21 SupplA: S106-12. 

The antibodies of the invention may also be generated by administering the polynucleotide 
sequence encoding for the SARS antigen into an animal. The SARS antigen is then expressed in 
vivo, and antibodies specific to the SARS antigen are generated in vivo. Methods for 
polynucleotide delivery of the SARS antigens of the invention are discussed in section 4 below. 
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Hie antibodies of the invention are preferably specific to the SARS vims. 

H. Combinatio ns of one or more of any of the above approaches in a vaccine 

Hie compositions of the invention further comprise combinations of one or more of the 
compositions discussed above. For instance, the invention comprises a composition comprising 
5 an attenuated SARS vims and a subunit SARS viral antigen. 

I. Combinations of SARS antieens and other Respiratory Virus Antigens 

The invention further relates to vaccine formulations comprising one or more SARS virus 
antigens and one or more other respiratory virus antigens. Additional respiratory virus antigens 
suitable for use in the invention include antigens from influenza virus, human rhinovirus (HRV), 
0 parainfluenza virus (PIV), respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and 
rhinovirus. The additional respiratory virus antigen could also be from a coronavinis other than 
the SARS coronavinis, such as the NL63 human coronavinis (van der Hoek et al (2004) Nature 
Medicine 10:368-373). Preferably, the additional respiratory virus antigen is an influenza viral 
antigen. 

5 The invention may also comprise one or more bacterial or viral antigens in combination 

with the SARS viral antigen. Antigens may be used alone or in any combination. (See, e.g., 
WO 02/00249 describing the use of combinations of bacterial antigens). The combinations may 
include multiple antigens from the same pathogen, multiple antigens from different pathogens or 
multiple antigens from the same and from different pathogens. Thus, bacterial, viral, and/or 

) other antigens may be included in the same composition or may be administered to the same 
subject separately. It is generally preferred that combinations of antigens be used to raise an 
immune response be used in combinations. 

Non-limiting examples of bacterial pathogens which may be used in the invention include 
diphtheria (See, e.g., Chapter 3 of Vaccines, 1998, eds. Plbtkin & Mortimer (ISBN 0-7216-1946- 
0), staphylococcus (e.g., Staphylococcus aureus as described in Kuroda et aL (2001) Lancet 
357:1225-1240), cholera, tuberculosis, C. tetani, also known as tetanus (See, e.g., Chapter 4 of 
Vaccines, 1998, eds. Plotkin & Mortimer (ISBN 0-7216-1946-0), Group A and Group B 
streptococcus (including Streptococcus pneumoniae, Streptococcus agalactiae and Streptococcus 
pyogenes as described, for example, in Watson et al (2000) Pediatr. Infect. Dis. J. 19:331-332; 
Rubin et aL (2000) Pediatr Clin. North Am. 47:269-284; Jedrzejas et al. (2001) Microbiol Mol 
Biol Rev 65:187-207; Schuchat (1999) lam** 353:51-56; GB patent applications 0026333.5; 
0028727.6; 015640.7; Dale et al. (1999) Infect Dis Clin North Am 13:227-1243; Ferretti et aL 
(2001) PNAS USA 98:4658-4663), pertussis (See, e.g., Gusttafsson et al (1996) N. Engl. J. Med 
334:349-355; Rappuoli etal. (1991) JWTECH 9:232-238), meningitis, Moraxella catarrhalis 
(See, e.g., McMichael (2000) Vaccine 19 Suppl. 1:S 101-107) and other pathogenic states, 
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including, without limitation, Neisseria meningitides (A, B, C, Y), Neisseria gonorrhoeae (See, 
e.g., WO 99/24578; WO 99/36544; and WO 99/57280), Helicobacter pylori (e.g., CagA, VacA, 
NAP, HopX, HopY and/or urease as described, for example, WO 93/18150; WO 99/53310; WO 
98/04702) and Haemophilus influenza. Hemophilus influenza type B (EflQB) (See, e.g. y 

5 Costantino et aL (1999) Vaccine 17:1251-1263), Porphyromonas gingivalis (Ross et aL (2001) 
Vaccine 19:4135-4132) and combinations thereof. 

Non-limiting examples of viral pathogens which may be used in the invention include 
meningitis, riunoviius, influenza (Kawaoka et aL $ Virology (1990) 179:759-767; Webster et cd., 
"Antigenic variation among type A influenza viruses," p. 127-168. In: P. Palese and D.W. 

10 Kingsbury (ed), Genetics of influenza viruses. Springer-Verlag, New York), respiratory 
syncytial virus (RSV), parainfluenza virus (PIV), rotavirus (e.g., VP1, VP2, VP3, VP4, VP6, 
VP7, NSP1, NSP2, NSP3, NSP4 or NSP5 and other rotavirus antigens, for example as described 
in WO 00/26380) and the like. Antigens derived from other viruses will also find use in the 
present invention, such as without limitation, proteins from members of the families 

15 Picomaviridae (e.g., polioviruses, etc. as described, for example, in Sutter et aL (2000) Pediatr 
Clin North Am 47:287-308; Zimmerman & Spann (1999) Am Font Physician 59:113-118; 125- 
126); Caliciviridae; Togaviridae (e.g., rubella virus, etc.); Flaviviridae, including the genera 
flavivirus (e.g., yellow fever virus, Japanese encephalitis virus, serotypes of Dengue virus, tick 
borne encephalitis virus, West Nile virus, St. Louis encephalitis virus); pestivirus (e.g., classical 

iO porcine fever virus, bovine viral diarrhea virus, border disease virus); and hepacivirus (e.g., 
hepatitis A, B and C as described, for example, in US Patent Nos. 4,702,909; 5,01 1,915; 
5,698,390; 6,027,729; and 6,297,048); Parvovirus (e.g., parvovirus B19); Coronaviridae; 
Reoviridae; Bimaviridae; Rhabodoviridae (e.g., rabies virus, etc. as described for example in 
Dressen et aL (1997) Vaccine 15 Suppl:s2-6; MMWR Morb Mortal Wkly Rep. 1998 Jan 

>5 16:47(1): 12, 19); Hloviridae; Paramyxoviridae (e.g. , mumps virus, measles virus, respiratory 
syncytial virus, etc. as described in Chapters 9 to 11 of Vaccines, 1998, eds. Plotkin & Mortimer 
(ISBN 0-7216-1946-0); Orthomyxoviridae (e.g., influenza virus types A, B and C, etc. as 
described in Chapter 19 of Vaccines, 1998, eds. Plotkin & Mortimer (ISBN 0-7216-1946-0),.); 
Bunyaviridae; Arenaviridae; Retroviradae (e.g., HTLV-1; HTLV-11; HTV-1 (also known as 

10 HTLV-m, LAV, ARV, HTIJR, etc.)), including but not limited to antigens from the isolates 
HIVIllb, EHVSF2, HIVLAV, HIVI-AL, I-IIVMN, SF162); HIV- 1 CM235, HIV- 1 US4; HTV-2; 
simian immunodeficiency virus (SIV) among others. Additionally, antigens may also be derived 
from human papilloma virus (HPV) and the tick-borne encephalitis viruses. See, e.g. Virology, 
3 rd Edition (W.K. Joklik ed. 1988); Fundamental Virology, 2nd Edition (BJNT. Fields and D.M. 

15 Knipe, eds, 1991), for a description of these and other viruses. 
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Proteins may also be derived from the herpes vims family, including proteins derived from 
herpes simplex vims (HS V) types 1 and 2, such as HSV-1 and HSV-2 glycoproteins gB , gD and 
gH; antigens derived from varicella zoster virus (VZV), Epstein-Barr virus (EBV) and 
cytomegalovirus (CMV) including CMV gB and gH (See, US Patent No. 4,689,225 and PCT 

5 Publication WO 89/07143); and antigens derived from other human herpesviruses such as HHV6 
and HHV7. (See, e.g. Chee etaL, Cytomegaloviruses (J.K. McDougall, ed, Springer- Verlag 
1990) pp. 125-169, for a review of the protein coding content of cytomegalovirus; McGeoch et 
al, J. Gen. Virol (1988) 69:1531-1574, for a discussion of the various HSV-1 encoded proteins; 
US Patent No. 5,171,568 for a discussion of HSV-1 and HSV-2 gB and gD proteins and the 

10 genes encoding therefor, Baa: et a/., Nature (1984) 310:207-211, for the identification of protein 
coding sequences in an EBV genome; and Davison and Scott, /. Gen. Virol (1986) 67:1759- 
1816, for a review of VZV). Herpes simplex virus (HSV) rgD2 is a recombinant protein 
produced in genetically engineered Chinese hamster ovary cells. This protein has the normal 
anchor region truncated, resulting in a glycosylated protein secreted into tissue culture medium. 

15 The gD2 can be purified in the CHO medium to greater than 90% purity. Human 

immunodeficiency virus (HIV) env-2-3 is a recombinant form of the HIV enveloped protein 
produced in genetically engineered Saccharomyces cerevisae. This protein represents the entire • 
protein region of HTV gpl20 but is non-glycosylated and denatured as purified from the yeast. 
HIV gpl20 is a fully glycosylated, secreted form of gpl20 produced in CHO cells in a fashion 

10 similar to the gD2 above. Additional HSV antigens suitable for use in immunogenic 

compositions are described in PCT Publications W0 85/04587 and W0 88/02634, the disclosures 
of which are incorporated herein by reference in their entirety. Mixtures of gB and gD antigens, 
which are truncated surface antigens lacking the anchor regions, are particularly preferred. 

Antigens from the hepatitis family of viruses, including hepatitis A virus (HAV) {See, e.g., 

15 Bell etal. (2000) Pediatr Infect Dis. J. 19:1187-1188; Iwarson (1995)^*03 103:321-326), 
hepatitis B virus (HBV) (See, e.g., Gerlich et al (1990) Vaccine 8 Suppl:S63-68 & 79-80), 
hepatitis C virus (HCV) (See, e.g., PCT/US88/04125, published European application number 
318216), the delta hepatitis virus (HDV), hepatitis E virus (HBV) and hepatitis G virus (HGV), 
can also be conveniently used in the techniques described herein. By way of example, the viral 

0 genomic sequence of HCV is known, as are methods for obtaining the sequence. See, e.g. , 

International Publication Nos. WO 89/04669; WO 90/11089; and WO 90/14436. Also included 
in the invention are molecular variants of such polypeptides, for example as described in 
PCT/US99/31245; PCT/US99/31273 and PCT/US99/3 1272. The HCV genome encodes several 
viral proteins, including El (also known as E) and E2 (also known as E2/NSI) and an N-tenninal 
5 nucleocapsid protein (termed "core") (see, Houghton et a/., Hepatology (1991) 14:381-388, for a 
discussion of HCV proteins, including El and E2). Similarly, the sequence for the 8-antigen 
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from HDV is known (see, e.g., US Patent No. 5378,814) and this antigen can also be 
conveniently used in the present composition and methods. Additionally, antigens derived from 
HBV, such as the core antigen, the surface antigen, SAg, as well as the presurface sequences, 
pre-Sl and pre-S2 (formerly called pre-S), as well as combinations of the above, such as 
SAg/pre-Sl, SAg/pre-S2, SAgfpre-Sl/pre^S2, and pre-S l/pie-S2, will find use herein. See, e.g., 
"HBV Vaccines - from the laboratory to license: a case study" in Mackett, M. and Williamson, 
JX>., Human Vaccines and Vaccination, pp. 159-176, for a discussion of HBV structure; and US 
Patent Nos. 4,722,840, 5,098,704, 5,324,513, incorporated herein by reference in their entireties; 
Beames et aL, J. Virol (1995) 69:6833-6838, Birnbaum et aL, J. Virol (1990) 64:3319-3330; 
and Zhou et al r J. Virol (1991) 65:5457-5464. Each of these proteins, as well as antigenic 
fragments thereof, will find use in the present composition and methods. 

Influenza virus is another example of a virus for which the present invention will be 
particularly useful. Specifically, the envelope glycoproteins HA and NA of influenza A are of 
particular interest for generating an immune response. Numerous HA subtypes of influenza A 
have been identified (Kawaoka et al 9 Virology (1990) 179:759-767; Webster et aL, "Antigenic 
variation among type A influenza viruses," p. 127-168. In: P. Palese and D.W. Kingsbury (ed.), 
Genetics of influenza viruses. Springer-Verlag, New York). Thus, proteins derived from any of 
these isolates can also be used in the compositions and methods described herein. 

Non-limiting examples of parasitic antigens include those derived from organisms causing 
malaria and Lyme disease. 

Hie methods of the invention comprise administering an immunogenic composition 
comprising a SARS viral antigen (including one or more of an inactivated SARS virus, an . 
attenuated SARS virus, a split SARS virus preparation or a recombinant or purified subunit 
formulation of one or more SARS viral antigens) to an animal. Hie immunogenic' compositions 
used in the invention can comprise an immunologically effective amount of the SARS viral 
antigen. An "immunologically effective amount" is an amount sufficient to allow the mammal 
to raise an immune response to the SARS antigen. 

The immune response preferably involves the production of antibodies specific to the 
SARS antigen. Hie amount of antibodies produced will vary depending on several factors 
including the animal used, the presence of an adjuvant, etc. 

The immunogenic compositions of the invention may further comprise one or more 
adjuvants. 

The immunogenic compositions of the invention may be administered mucosally. Suitable 
routes of mucosal administration include oral, intranasal, intragastric, pulmonary, intestinal, 
rectal, ocular and vaginal routes. The immunogenic composition may be adapted for mucosal 
administration. For instance, where the composition is for oral administration, it may be in the 

-162- 



WO 2004/092360 



PCT/US2004/011710 



form of tablets or capsules, optionally enteric-coated, liquid, transgenic plants, etc. Where the 
composition is for intranasal administration, it may be in the form of a nasal spray, nasal drops, 
gel or powder. 

The immunogenic compositions of the invention may be administered parenterally. 
5 Suitable routes of parenteral administration include intramuscular (IM), subcutaneous, 

intravenous, intraperitoneal, intradermal, transcutaneous, and transdermal (see e.g., International 
patent application WO 98/20734) routes, as well as delivery to the interstitial space of a tissue. 
The immunogenic composition may be adapted for parenteral administration, for instance in the 
form of an injectable that may be sterile and pyrogen free. 
10 Vaccines of the invention may be administered in conjunction with other 

immunoiegulatory agents. In particular, compositions will usually include an adjuvant Preferred 
further adjuvants include, but are not limited to, one or more of the following set forth below: 

A. Mineral Containing Compositions 

Mineral containing compositions suitable for use as adjuvants in the invention include 
15 mineral salts, such as alumi n ium salts, and calcium salts. The invention includes mineral salts 

such as hydroxides (e.g. oxyhydroxides), phosphates (e.g. hydroxyphoshpates, orthophosphates), • 
sulphates, etc. (e.g. see chapters 8 & 9 of Vaccine design:the submit and adjuvant approach 
(1995) Powell & Newman. ISBN 0-306-44867-X.), or mixtures of different mineral compounds, 
with the compounds taking any suitable form (e.g. gel, crystalline, amorphous, etc.), and with 
K) adsorption being preferred. The mineral containing compositions may also be formulated as a 
particle of metal salt See WO00/23105. 

B. Oil-Emulsions 

Oil-emulsion compositions suitable for use as adjuvants in the invention include squalene- 
water emulsions, such as MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated 
5 into submicron particles using a microfluidizer). See WO90/14837. See also, Prey etaL, 
"Comparison of the safety, tolerability, and immunogenicity of a MF59-adjuvanted influenza 
vaccine and a non-adjuvanted influenza vaccine in non-elderly adults", Vaccine (2003) 21 :4234- 
4237. 

Particularly preferred adjuvants for use in the compositions are submicron oil-inwater 
0 emulsions. Preferred submicron oil-in-water emulsions for use herein are squalene/water 

emulsions optionally containing varying amounts of MTP-PE, such as a submicron oil-in-water 
emulsion containing 4-5% w/v squalene, 0.25-1.0% w/v Tween 80 ™ (polyoxyelthylenesorbitan 
monooleate), and/or 0.25-1.0% Span 85™ (sorbitan trioleate), and, optionally, N-acetylmuramyl- 
L-alanyl-D-isogluatminyl-I^alan^ 
5 huydroxyphosphophoryloxy>ethylamine (MTP-PE), for example, the submicron oil-in-water 
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emulsion known as "MF59" (International Publication No. WO 90/14837; US Patent Nos. 
6,299,884 and 6,451325, incorporated herein by reference in their entireties; and Ott et aL> 
"MF59 - Design and Evaluation of a Safe and Potent Adjuvant for Human Vaccines" in Vaccine 
Design: The Subunit and Adjuvant Approach (Powell, Mi 7 , and Newman, MJ. eds.) Plenum 

5 Press, New York, 1995, pp. 277-296). MF59 contains 4-5% w/v Squalene (e.g. 9 4.3%), 0.25- 
0.5% w/v Tween 80™, and 0.5% w/v Span 85™ and optionally contains various amounts of 
MTP-PE, formulated into submicron particles using a microfluidizer such as Model HOY 
microfluidizer (Microfluidics, Newton, MA). For example, MTP-PE may be present in an 
amount of about 0-500 jig/dose, more preferably 0-250 jig/dose and most preferably, 0-100 

10 fig/dose. As used herein, the term "MF59-0" refers to the above submicron oil-in-water 

emulsion lacking MTP-PE, while the tenn MF59-MTP denotes a formulation that contains MTP- 
PE. For instance, "MF59-100" contains 100 \ig MTP-PE per dose, and so on. MF69, another 
submicron oil-in-water emulsion for use herein, contains 4.3% w/v squalene, 0.25% w/v Tween 
80™, and 0.75% w/v Span 85™ and optionally MTP-PE. Yet another submicron oil-in-water 

5 emulsion is MF75, also known as SAF, containing 10% squalene, 0.4% Tween 80™, 5% 

pluronic-blocked polymer L121, and thr-MDP, also microfluidized into a submicron emulsion. 
MF75-MTP denotes an MF75 formulation that includes MTP, such as from 10O400 jig MTP-PE 
per dose. 

Submicron oil-in-water emulsions, methods of making the same and immunostimulating 
0 agents, such as muramyl peptides, for use in the compositions, are described in detail in 
International Publication No. WO 90114837 and US Patent Nos. 639,884 and 6,45 1,325, 
incorporated herein by reference in their entireties. 

Complete Freund's adjuvant (CPA) and incomplete Freund's adjuvant (IFA) may also be 
used as adjuvants in the invention. 
5 C. Saponin Formulations 

Saponin formulations, may also be used as adjuvants in the invention. Saponins are a 
heterologous group of sterol glycosides and triterpenoid glycosides that are found in the bark, 
leaves, stems, roots and even flowers of a wide range of plant species. Saponin fiom the bark of 
the QuiUaia saponaria Molina tree have been widely studied as adjuvants. Saponin can also be 
) commercially obtained from Smilax ornata (sarsaprilla), Gypsophilla paniculata (brides veil), 
and Saponaria officianalis (soap root). Saponin adjuvant formulations include purified 
formulations, such as QS21, as well as lipid formulations, such as ISCOMs. 

Saponin compositions have been purified using High Performance Thin Layer 
Chromatography (HP-LC) and Reversed Phase High Performance Liquid Chromatography (RP- 
! HPLC). Specific purified fractions using these techniques have been identified, including QS7, 
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QS17,QS18,QS21,QH-AQH-BandQH-C. Preferably, the saponin is QS21. A method of 
production of QS21 is disclosed in US Patent No. 5,057,540. Saponin formulations may also 
comprise a sterol, such as cholesterol (see WO 96/33739). 

Combinations of saponins and cholesterols can be used to form unique particles called 

5 Immunostimulating Complexs (ISCOMs). ISCOMs typically also include a phospholipid such 
as phosphatidylemanolamine or phosphatidylcholine. Any known saponin can be used in 
ISCOMs. Preferably, the ISCOM includes one or more of Quil A, QHA and QHC. ISCOMs are 
further described in EP 0 109 942, WO 96711711 and WO 96733739. Optionally, the ISCOMS 
may be devoid of additional detergent See WO00/07621. 

10 A review of the development of saponin based adjuvants can be found at Bair, et aL, 

"ISCOMs and other saponin based adjuvants", Advanced Drug Delivery Reviews (1998) 
32:247-271. See also Sjolander, et al, "Uptake and adjuvant activity of orally delivered saponin 
and ISCOM vaccines", Advanced Drug Delivery Reviews (1998) 32:321-338. 
D. Bacterial or Microbial Derivatives 

5 Adjuvants suitable for use in the invention include bacterial or microbial derivatives such 

as: 

(1) Non-toxic^ derivatives of enterobacterial lipopolysaccharide (LPS) 

Such derivatives include Monophosphoryl lipid A (MPL) and 3-O-deacylated MPL 
(3dMPL). 3dMPL is a mixture of 3 De-O-acylated monophosphoryl lipid A with 4, 5 or 6 
0 acylated chains. A preferred "small particle" form of 3 De-O-acylated monophosphoryl lipid A 
is disclosed in EP 0 689 454. Such '.'small particles" of 3dMPL are small enough to be sterile 
filtered through a 0.22 micron membrane (see EP 0 689 454). Other non-toxic LPS derivatives 
include monophosphoryl lipid A mimics, such as aminoalkyl glucosaminide phosphate 
derivatives e.g. RC-529. See Johnson et al. (1999) Bioorg Med Chem Lett 9:2273-2278. 
5 (2) Lipid A Derivatives 

Lipid A derivatives include derivatives of lipid A from Escherichia coli such as OM-174. 
OM-174 is described for example in Meraldi et aL, "OM-174, a New Adjuvant with a Potential 
for Human Use, Induces a Protective Response with Administered with the Synthetic C- 
Terminal Fragment 242-310 from the rircumsporozoite protein of Plasmodium berghei", 
) Vaccine (2003) 21:2485-2491; and Pajak,ef al., "The Adjuvant OM-174 induces both the 
migration and maturation of murine dendritic cells in vivo", Vaccine (2003) 21:836-842. 
(3) Immunostimulatory oligonucleotides 

Immunostimulatory oligonucleotides suitable for use as adjuvants in the invention include 
nucleotide sequences containing a CpG motif (a sequence containing an unmethylated cytosine 
! followed by guanosine and linked by a phosphate bond). Bacterial double stranded RNA or 
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oligonucleotides containing palindromic or poly(dG) sequences have also been shown to be 
immunostimulatory. 

The CpG's can include nucleotide modifications/analogs such as phosphorothioate 
modifications and can be double-stranded or single-stranded. Optionally, the guanosine may be 

5 replaced with an analog such as 2 , -deoxy-7-deazaguanosine. See Kandimalla, et al. y 'Divergent 
synthetic nucleotide motif recognition pattern: design and development of potent 
immunomodulatory oligodeoxyribonucleotide agents with distinct cytokine induction profiles", 
Nucleic Acids Research (2003) 31(9): 2393-2400; WO 02/26757 and WO 99/62923 for 
examples of possible analog substitutions. The adjuvant effect of CpG oligonucleotides is further 

10 discussed in Krieg, "CpG motifs: the active ingredient in bacterial extracts?", Nature Medicine 
(2003) 9(7): 831-835; McCIuskie, et oi, "Parenteral and mucosal prime-boost immunization 
strategies in mice with hepatitis B surface antigen and CpG DNA", HEMS Immunology and 
Medical Microbiology (2002) 32:179-185; WO 98/40100; US Patent No. 6,207,646; US Patent 
No. 6,239,116 and US Patent No. 6,429,199. 

15 The CpG sequence may be directed to TLR9, such as the motif GTCGTT or TTCGTT. See 

Kandimalla, et al. 9 "Toll-like receptor 9: modulation of recognition and cytokine induction by 
novel synthetic CpG DNAs", Biochemical Society Transactions (2003) 31 (part 3): 654^658. The 
CpG sequence may be specific for inducing a Thl immune response, such as a CpG-A ODty or 
it may be more specific for inducing a B cell response, such a CpG-B ODN. CpG-A and CpG-B 

10 ODNs are discussed in Blackwell, et al. , "CpG- A-Induced Monocyte IFN-gamma-Inducible 
Protein-10 Production is Regulated by Plasmacytoid Dendritic Cell Derived IFN-alpha", J. 
Immunol. (2003) 17Q(8):4061-4068; Krieg, "From A to Z on CpG", TRENDS in Immunology 
(2002) 23(2): 64-65 and WO 01/95935, Preferably, the CpG is a CpG-AODN. 

Preferably, the CpG oligonucleotide is constructed so that the 5* end is accessible for 

15 receptor recognition. Optionally, two CpG oligonucleotide sequences may be attached at their 3' 
ends to form "immunomers". See, for example, Kandimalla, et aL 9 "Secondary structures in CpG 
oligonucleotides affect immunostimulatory activity", BBRC (2003) 306:948-953: Kandimalla, et 
aL 9 "Toll-like receptor 9: modulation of recognition and cytokine induction by novel synthetic 
GpG DNAs", Biochemical Society Transactions (2003) 31(part 3):664-658; Bhagat et dL y "CpG 

JO penta- and hexadeoxyribonucleotides as potent immunomodulatory agents" BBRC (2003) 
300:853-861 and WO 03/035836. 

(4) ADP-ribosylating toxins and detoxified derivatives thereof. 

Bacterial ADP-ribosylating toxins and detoxified derivatives thereof may be used as 
adjuvants in the invention. Preferably, the protein is derived from E. coli (i.e., E. coli heat labile 

5 enterotoxin "LT), cholera ("CT"), or pertussis ("PT"). The use of detoxified ADP-ribosylating 
toxins as mucosal adjuvants is described in WO 95/17211 and as parenteral adjuvants in WO 
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98/42375. Preferably, the adjuvant is a detoxified LT mutant such as LT-K63, LT-R72, and 
LTR192G. The use of ADP-ribosylating toxins and detoxified derivaties thereof, particularly 
LT-K63 and LT-R72, as adjuvants can be found in the following references, each of which is 
specifically incorporated by reference herein in their entirety: Beignon, et aL, "The LTR72 
Mutant of Heat-Labile Enterotoxin of Escherichia coli Enahnces the Ability of Peptide Antigens 
to Elicit CD4+ T Cells and Secrete Gamma Interferon after Coapplication onto Bare Skin", 
Infection and Immunity (2002) 70(6):3012-3019; Pizza, et aL, "Mucosal vaccines: non-toxic 
derivatives of LT and CT as mucosal adjuvants", Vaccine (2001) 19:2534-2541; Pizza, et aL , 
"LTK63 and LTR72, two mucosal adjuvants ready for clinical trials" Int J. Med. Microbiol 
(2000) 29Q(4-5):455^61; Scharton-Kersten etaL, 'Transcutaneous Immunization with Bacterial 
ADP-Ribosylating Exotoxins, Subunits and Unrelated Adjuvants", Infection and Immunity 
(2000) 68(9):5306-5313; Ryan et aL, "Mutants of Escherichia coli Heat-Labile Toxin Act as 
Effective Mucosal Adjuvants for Nasal Delivery of an Acellular Pertussis Vaccine: Differential 
Effects of the Nontoxic AB Complex and Enzyme Activity on Thl and Th2 Cells" Infection and 
Immunity (1999) 67(12):6270-6280; Partidos etaL, "Heat-labile enterotoxin of Escherichia coli 
and its site-directed mutant LTK63 enhance the proliferative and cytotoxic T-cell responses to 
intranasaUy co-immunized synthetic peptides", Immunol. Lett. (1999) 67(3):209-216; Peppoloni 
et aL, "Mutants of the Escherichia coli heat-labile enterotoxin as safe and strong adjuvants for . 
intranasal delivery of vaccines", Vaccines (2003) 2(2):285-293; and Pine et aL, (2002) 
"Intranasal immunization with influenza vaccine and a detoxified mutant of heat labile 
enterotoxin from Escherichia coli (LTK63)" J. Control Release (2002) 85(l-3):263-270. 

Numerical reference for amino acid substitutions is preferably based on the alignments of the A 
and B subunits of ADP-ribosylating toxins set forth in Domenighini et al., Mol. Microbiol 

(1995) 15(6):1 165-1167, specifically incorporated herein by reference in its entirety. 

E. Human Tmmnnnrnodulators 

Human immunomodulators suitable for use as adjuvants in the invention include cytokines, 

such as interleukins (e.g. JL-1, JL-2, IL4, BL-5, IL-6, IL-7, IL-12, etc.), interferons (e.g. 

interferon^), macrophage colony stimulating factor, and tumor necrosis factor. 

P. Bioadhesives and Mucoadhesives 

Bioadhesives and mucoadhesives may also be used as adjuvants in the invention. Suitable 

bioadhesives include esterified hyaluronic acid microspheres (Singh et al. (2001) /. Cont. Rele. 

70:267-276) or mucoadhesives such as cross-linked derivatives of poly(acrylic acid), polyvinyl 

alcohol, polyvinyl pyrollidone, polysaccharides and carboxymethylcellulose. Chitosan and 

derivatives thereof may also be used as adjuvants in the invention. Kg., WO99/27960. 
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G. Microparticles 

Microparticles may also be used as adjuvants in the invention. Microparticles (Le. a 
particle of -lOOnm to ~150/xm in diameter, more preferably ~200nm to -30/tm in diameter, and 
most preferably ~500nm to ~10/xm in diameter) formed from materials that are biodegradable 
and non-toxic (e.g. a poly(a-hydroxy acid), a polyhydroxybutyric acid, a polyorthoester, a 
polyanhydride, a polycaprolactone, etc.), with polyOactide-co-glycolide) are preferred, 
optionally treated to have a negatively-charged surface (e.g. with SDS) or a positively-charged 
surface (e.g. with a cationic detergent, such as CTAB). 

H. Liposomes 

Examples of liposome formulations suitable for use as adjuvants are described in US 
Patent No. 6,090,406, US Patent No. 5,916,588, and EP 0 626 169. 

I. PolYOXvethylene ether and Polvoxvethvl^e Ester Formulations 

Adjuvants suitable for use in the invention include polyoxyethylene ethers and 
polyoxyethylene esters. W099/52549. Such formulations further include polyoxyethylene 
sorbitan ester surfactants in combination with an octoxynol (WO01/21207) as well as 
polyoxyethylene alkyl ethers or ester surfactants in combination with at least one additional non- . 
ionic surfactant such as an octoxynol (WO01/21152). 

Preferred polyoxyethylene ethers are selected from the following group: polyoxyethylene- 
9-lauryl ether (laureth 9), polyoxyethylene-9-steoryl ether, polyoxytheylene-8-steoryl ether, 
polyoxyefoylene-4-lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl 
ether. 

J. Polvphosphazene fPCPP) 

PCPP formulations are described, for example, in Andrianov et al, "Preparation of 
hydrogel microspheres by coacervation of aqueous polyphophazene solutions", Biomaterials 
(1998) 19(1-3):109-115 and Payne et aL, "Protein Release from Polyphosphate Matrices", 
Adv. Drug. Delivery Review (1998) 31(3): 185-196. 
K. Muramyl pep tides 

Examples of muramyl peptides suitable for use as adjuvants in the invention include N- 
acetyl-muramyl-I.tlireonyl-D-isoglutaniine (thr-MDP), N-acetyl-nonnuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), andN-acetylmuramyl-i.alanyl-^isoglutaimny^ 
mpalmitoyl-OT-gly M ro-3-hydroxyphosphoiyloxy>emylamm MTP-PE). 

L. TmiHa zoouinolone Comp minds 

Examples of imidazoquinolone compounds suitable for use adjuvants in the invention 
include Imiquamod and its homologues, described further in Stanley, "Lniquimod and the 
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imidazoquinolones: mechanism of action and therapeutic potential" Clin Exp Dermatol (2002) 
27(7):571-577 and Jones, "Resiquimod 3M", Curr Opin Investig Drugs (2003) 4(2):214-218. 
ML Virosomes a nd Virus Ti ke Particles (VLPs) 

Virosomes and Virus like Particles (VLPs) can also be used as adjuvants in the invention. 

5 These structures generally contain one or more proteins from a virus optionally combined or 
formulated with a phospholipid They are generally non-pathogenic, non-replicating and 
generally do not contain any of the native viral genome. The viral proteins may be recombinantly 
produced or isolated from whole viruses. These viral proteins suitable for use in virosomes or 
VLPs include proteins derived from influenza virus (such as HA or NA), Hepatitis B virus (such 

10 as core or capsid proteins), Hepatitis E virus, measles virus, Sindbis virus, Rotavirus, Foot-and- 
Mouth Disease virus, Retrovirus, Norwalk virus, human Papilloma virus, HIV, RNA-phages, 
QB-phage (such as coat proteins), GA-phage, fr-phage, AP205 phage, and Ty (such as 
retrotransposon Ty protein pi). VLPs are discussed further in WO 03/024480, WO 03/024481, 
and Niikura et at, "Chimeric Recombinant Hepatitis E Virus-like Particles as an Oral Vaccine 

15 Vehicle Presenting Foreign Epitopes", Virology (2002) 293:273-280; Lenz et ai, 

"PapMomarivurs-Iike Particles Induce Acute Activation of Dendritic Cells", Journal of 
Immunology (2001) 5246-5355; Pinto, et c£ , "Cellular Immune Responses to Human 
Papillomavirus (HPV>16 LI Healthy Volunteers Immunized with Recombinant HPV-16 LI 
Virus-like Particles", Journal of Infectious Diseases (2003) 188:327-338; and Gerber et aL y 

10 "Human Papillomavrisu Virus-like Particles Are Efficient Oral Immunogenswhen 

Coadministered with Escherichia coli Heat-Labile Entertoxin Mutant R192G or CpG", Journal 
of Virology (2001) 75(10):4752-4760. Virosomes are discussed further in, for example, Gluck 
et al., "New Technology Platforms in the Development of Vaccines for the Future", Vaccine 
(2002)20:B10-B16. 

5 The invention may also comprise combinations of aspects of one or more of the adjuvants 

identified above. For example, the following adjuvant compositions may be used in the 
invention: 

(1) a saponin and an oil-in-water emulsion (W099/11241); 

(2) a saponin (e.g.. y QS21) + a non-toxic LPS derivative (e.g. 9 3dMPL) (see WO 
0 94/00153); 

(3) a saponin (e.g.., QS21) + a non-toxic LPS derivative (e.g. 9 3dMPL) + a cholesterol; 

(4) a saponin (e.g. QS21) + 3dMPL + IL12 (optionally + a sterol) (W098/57659); 

(5) combinations of 3dMPL with, for example, QS21 and/or oil-in-water emulsions (See 
European patent applications 0835318, 0735898 and 0761231); 
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(6) SAP, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-block polymer L121, 
and thr-MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion. 

(7) Ritri™ adjuvant system (RAS), (Ribi Immunochem) containing 2% Squalene, 0.2% 
5 Tween 80, and one or more bacterial cell wall components from the group consisting of 

monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), 
preferably MPL + CWS (Detox™); and 

(8) one or more mineral salts (such as an aluminum salt) + a non-toxic derivative of LPS 
(such as 3dPML). 

10 Aluminium salts and MF59 are preferred adjuvants for parenteral immunisation. Mutant 

bacterial toxins are preferred mucosal adjuvants. 

As mentioned above, adjuvants suitable for use in the invention may also include one or 
more of die following: 

- Kcoli heat-labile enterotoxin ("LT"), or detoxified mutants thereof, such as the K63 or 
.5 R72 mutants; . 

- cholera toxin ("CT"), or detoxified mutants thereof; 

. . - microparticles (i.e., a particle of ~100nm to ~150/im in diameter, more preferably 
~200nm to ~30jim in diameter, and most preferably ~500nm to ~10/nn in diameter) formed from 
materials that are biodegradable and non-toxic (e.g. a poly(a-hydroxy acid), a 
0 polyhydroxybutyric acid, a polyorthoester, a polyanhydride, a polycaprolactone etc.); 

- a polyoxyethylene ether or a polyoxyethylene ester (see International patent application 
WO 99/52549); 

- a polyoxyethylene sorbitan ester surfactant in combination with an octoxynol (see 
International patent application WO 01/21207) or a polyoxyethylene alkyl ether or ester 

5 surfactant in combination with at least one additional non-ionic surfactant such as an octoxynol 
(see International patent application WO 01/21152); 

- chitosan (e.g. International patent application WO 99/27960) 

- an immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) and a saponin (see 
International patent application WO 00/62800) 

) - immunostimulatory double stranded RNA. 

- aluminum compounds (e.g. aluminum hydroxide, aluminum phosphate, aluminum 
hydroxyphosphate, oxyhydroxide, orthophosphate, sulfate etc. (e.g. see chapters 8 & 9 of 
Vaccine design: the submit and adjuvant aproach, eds. Powell & Newman, Plenum Press 1995 
(ISBN 0-306-44867-X) (hereinafter '"Vaccine design"), or mixtures of different aluminum 
compounds, with the compounds taking any suitable form (e.g. gel, crystalline, amorphous etc.), 
and with adsorption being preferred; 
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- MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated into submicron 
particles using a microfluidizer) (see Chapter 10 of Vaccine design; see also International patent 
application WO 90/14837); 

- liposomes (see Chapters 13 and 14 of Vaccine design); 
5 - ISCOMs (see Chapter 23 of Vaccine design); 

- SAP, containing 10% Squalane, 0.4% Tween 80, 5% pluroiiic-block polymer L121, and 
thr-MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion (see Chapter 12 of Vaccine design); 

- Ribi™ adjuvant system (RAS), (Ribi Immunochem) containing 2% Squalene, 0.2% 
10 Tween 80, and one or more bacterial cell wall components from the group consisting of 

monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), 
preferably MPL + CWS (Detox™); 

- saponin adjuvants, such as QuilA or QS21 (see Chapter 22 of Vaccine design), also 
known as Stimulon™; 

15 -ISCOMs, which may be devoid of additional detergent (WO 00/07621); 

- complete Freund's adjuvant (CPA) and incomplete Freund's adjuvant (IF A); 

- cytokines, such as interleukins (e.g. JLrl, IL-2, IL4, IL-5, IL-6, IL-7, IW2, etc.), 
interferons (e.g. interferon-^), macrophage colony stimulating factor, tumor necrosis factor, etc. 
(see Chapters 27 & 28 of. Vaccine design); 

fl - monophosphoryl lipid A (MPL) or 3-O-deacylated MPL (3dMPL) (e.g. chapter 21 of 

Vaccine design); 

- combinations of 3dMPL with, for example, QS21 and/or oil T in-water emulsions 
(European patent applications 0835318, 0735898 and 0761231); 

- oligonucleotides comprising CpG motifs (see Krieg (2000) Vaccine, 19:618-622; Krieg 
5 (2001) Curr. Opin. Mol. Ther., 2001, 3:15-24; WO 96/02555, WO 98/16247, WO 98/18810, WO 

98/40100, WO 98/55495, WO 98/37919 and WO 98/52581, etc.) Le. containing at least one CG 
dinucleotide, 

- a polyoxyethylene ether or a polyoxyethylene ester (International patent application 
W099/52549); 

3 - a polyoxyethylene sorbitan ester surfactant in combination with an octoxynol 

(International patent application WO 01/21207) or a polyoxyethylene alkyl ether or ester 
surfactant in combination with at least one additional non-ionic surfactant such as an octoxynol 
(WO 01/21 152); 

- an immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) and a saponin 
> (WO00/62800); 
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- an immunostimulant and a particle of metal salt (International patent application 
WO00/23105); 

- a saponin and an oil-in-water emulsion (WO 99/11241); 

- a saponin (e.g. QS21) + 3dMPL + IW2 (optionally + a sterol) (WO 98/57659). 
Other adjuvants suitable for mucosal or parenteral administration are also available (e.g. 

see chapter 7 of Vaccine design: the suburdt and adjuvant aproach, eds. Powell & Newman, 
Plenum Press 1995 (ISBN 0-30£44867-X). 

Mutants of LT are preferred mucosal adjuvants, in particular the "K63" and iC R72" mutants 
(e.g. see International patent application WO 98/18928), as these result in an enhanced immune 
response. 

Microparticles are also preferred mucosal adjuvants. These are preferably derived from a 
poly(a-hydroxy acid), in particular, from a polyflactide) ('TLA"), a copolymer of D,L-lactide 
and glycolide or gjycolic acid, such as a poly(D JL-lactide-co»glycolide) ("PLG" or "PLGA"), or 
a copolymer of DJ^lactide and caprolactone. Hie microparticles may be derived from any of 
various polymeric starting materials which have a variety of molecular weights and, in the case 
of the copolymers such as PLG, a variety of lactiderglycolide ratios, the selection of which will 
be largely a matter of choice, depending in part on the coadministered antigen. 

The SARS virus (inactivated or attenuated), viral antigens, antibodies or adjuvants of the 
invention may be entrapped within the microparticles, or may be adsorbed to them. Entrapment 
within PLG microparticles is preferred. PLG microparticles are discussed in further detail in 
Morris et al. 9 (1994), Vaccine, 12:5-11, in chapter 13 of Mucosal Vaccines, eds. Kiyono et aL 9 
Academic Press 1996 (ISBN 012410587), and in chapters 16 & 18 of Vaccine design: • the 
subunit and adjuvant aproach, eds. Powell & Newman, Plenum Press 1995 (ISBN 0-306-44867^ 
X). 

LT mutants may advantageously be used in combination with microparticle-entrapped 
antigen, resulting in significantly enhanced immune responses. 

Aluminium compounds and MF59 are preferred adjuvants for parenteral use. 
The composition may include an antibiotic. 

The immunogenic compositions of the invention may be administered in a single dose, or 
as part of an administration regime. The regime may include priming and boosting doses, which 
may be administered mucosally, parenterally, or various combinations thereof. 

The methods of the invention further comprise treating or preventing a SARS virus-related 
disease by administering to an animal a composition comprising an effective amount of the 
antibodies of the invention. An "effective amount* * of the antibodies of the invention is an 
amount sufficient to provide passive immunization protection or treatment to the animal. 
Preferably, the antibodies of the invention are specific to the SARS viral antigen. 
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Methods of treatment may combine both immunogenic compositions and antibody 
compositions. Accordingly the invention comprises a method for treating or preventing a SARS 
virus-related disease comprising administering an immunogenic composition comprising an 
immunologically effective amount of a SARS viral antigen and administering an effective 
5 amount of antibodies specific to SARS viral antigen. The immunogenic composition and the 
antibodies may be administered together or separately. The invention further comprises a 
composition comprising an immunogenic composition comprising an immunologically effective 
amount of a SARS viral antigen and further comprising an effective amount of antibodies 
specific to a SARS viral antigen. 
10 The SARS viral antigens and antibodies of the invention may also be administered in 

polynucleotide form. The SARS viral antigens and/or antibody proteins are then expressed in 
vivo. 

The SARS viral antigens and the antibodies of the invention can also be delivered using 
one or more gene vectors, administered via nucleic acid immunization or the like using- standard 

15 gene delivery protocols. Methods for gene delivery are known in the art See, e.g. y US Patent 
Nos. 5,399346, 5,580,859, 5,589,466. The constructs can be delivered (e.g., injected) either ■ 
subcutaneously, epidermally, intradennally, intramuscularly, intravenous, mucosally (such as 
nasally, rectally and vaginally), intraperitoneally, orally or combinations thereof. Intramuscular 
injection of 25f*g plasmid DNA encoding spike antigens, in 200/il PBS pH 7.4, at weeks 0, 3 and 

20 . 6, has been described for mice by Yang et al. (2004) Nature 428:561-564. 

An exemplary replication-deficient gene delivery vehicle that may be used in the practice 
of the present invention is any of the alphavirus vectors, described in, for example, US Patent 
Nos. 6342372; 6,329,201 and International Publication WO 01/92552. 

A number of viral based systems have been developed for gene transfer into mammalian 

25 cells. For example, retroviruses provide a convenient platform for gene delivery systems. 
Selected sequences can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered to cells of 
the subject either in vivo or ex vivo. A number of retroviral systems have been described (US 
Patent No. 5,219,740; Miller & Rosman, BioTechniques (1989) 7:980-990; Miller, A.D., Human 

30 Gene Therapy (1990) 1:5-14; Scarpa et al., Virology (1991) 180:849-852; Burns et at, Proc. 
Natl Acad. Sci. USA (1993) 90:8033-8037; andBoris-Lawrie & Temin, Cur. Opin. Genet. 
Develop. (1993) 3:102-109. 

A number of adenovirus vectors have also been described. Unlike retroviruses which 
integrate into the host genome, adenoviruses persist extrachromosomally thus minimizing the 

J5 risks associated with insertional mutagenesis (Haj-Ahmad and Graham, /. Virol (1986) 57:267- 
274; Bett et al, J. Virol (1993) 67:591 1-5921; Mittereder et al, Human Gene Therapy (1994) 
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5:717-729; Seth et aL, J. Virol (1994) 68:933-940; Bair et aL, Gene Therapy (1994) 1:51-58; 
Beikner, KJL BioTecfmiques (1988) 6:616-629; and Rich et aL, Human Gene Therapy (1993) 
4:461-476). Adenoviral delivery of codon-optimsed versions of the genes encoding SARS 
coronavirus structural antigens spike SI, membrane protein and nucleocapsid protein has been 

5 investigated in rhesus macaques and found to invoke a strong neutralizing antibody response 
(Gao et aL (2003) Lancet 362(9399):1895-1896). 

Additionally, various adeno-associated virus (AAV) vector systems have been developed 
for gene delivery. AAV vectors can be readily constructed using techniques well known in the 
art See, e.g., US Patent Nos. 5,173,414 and 5,139,941; International Publication Nos. WO 

10 92/01070 (published 23 January 1992) and WO 93/03769 (published 4 March 1993); Lebkowski 
et aL, Molec. Cell Biol (1988) 8:3988-3996; Vincent et aL , Vaccines 90 (1990) (Cold Spring 
Harbor Laboratory Press); Carter, BJ. Current Opinion in Biotechnology (1992) 3:533-539; 
Muzyczka, Nv Current Topics in Microbiol and Immunol (1992) 158:97-129; Kotin, RM. 
Human Gene Therapy (1994) 5:793-801; Shelling and Smith, Gene Therapy (1994) 1:165-169; 

L5 and Zhou et aL, J. Exp. Med (1994) 179:1867-1875. : 

Another vector system useful for delivering polynucleotides, mucosally and otherwise, is 
the enterically administered recombinant poxvirus vaccines described by Small, Jr., P.A., et aL 
(US Patent No. 5,676,950, issued October 14, 1997, herein incorporated by reference) as well as 
the vaccinia virus and avian poxviruses. By way of example, vaccinia virus recombinants 

10 expressing the genes can be constructed as follows. The DNA encoding the SARS antigen or 
antibody or antibody coding sequence is first inserted into an appropriate vector so that it is 
adjacent to a Vaccinia promoter and flanking vaccinia DNA sequences, such as the sequence 
encoding thymidine kinase (TK). This vector is then used to transfect cells that are 
simultaneously infected with vaccinia. Homologous recombination serves to insert the vaccinia 

55 promoter plus the gene encoding the coding sequences of interest into the viral genome. The 
resulting TK recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can also be used 
to deliver genes encoding the SARS viral antigens or antibodies of the invention. Recombinant 

•0 avipox viruses, expressing immunogens from mammalian pathogens, are known to confer 
protective immunity when administered to non-avian species. The use of an avipox vector is 
particularly desirable in human and other mammalian species since members of the avipox genus 
can only productively replicate in susceptible avian species and therefore are not infective in 
mammalian cells. Methods for producing recombinant avipoxviruses are known in the art and 

5 employ genetic recombination, as described above with respect to the production of vaccinia 



-174- 



WO 2004/092360 



PCT/US2004/011710 



viruses. See, e.g. 9 WO 91/12882; WO 89/03429; and WO 92/03545. Picomavirus-derived 
vectors can also be used (See, e.g., US Patent Nos. 5,614,413 and 6,063384). 

Molecular conjugate vectors, such as the adenovirus chimeric vectors described in Michael 
et al, J. Biol Chenu (1993) 268:6866-6869 and Wagner et oi, Proc. Natl Acad. Set USA 

5 (1992) 89:6099-6103, can also be used for gene delivery. 

A vaccinia based infection/transfection system can be conveniently used to provide for 
inducible, transient expression of the coding sequences of interest (for example, a SARS viral 
antigen or antibody expression cassette) in a host cell. In this system, cells are first infected in 
vitro with a vaccinia virus recombinant that encodes the bacteriophage 17 RNA polymerase. 

10 This polymerase displays exquisite specificity in that it only transcribes templates bearing 17 
promoters. Following infection, cells are transfected with the polynucleotide of interest, driven 
by a 17 promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA that is then translated into protein by the . 
host translational machinery. The method provides for high level, transient, cytoplasmic 

15 production of large quantities of RNA and its translation products. See, e.g., Elroy-Stein and 
Moss, Proc. Natl Acad Sci. USA (1990) 87:6743-6747; Fuerst et at, Proc. Natl Acad Sci. USA 
(1986) 83:8122^8126. 

As an alternative approach to infection with vaccinia or avipox virus recombinants, or to 
the delivery of genes using other viral vectors, an amplification system can be used that will lead 

10 to high level expression following introduction into host cells. Specifically, a T7 RNA 

polymerase promoter preceding the coding region for 17 RNA polymerase can be engineered 
Translation of RNA derived from this template will generate 17 RNA polymerase that in turn 
will transcribe more template. Concomitantly, there will be a cDNA whose expression is under 
the control of the 17 promoter. Thus, some of the 17 RNA polymerase generated from 

15 translation of the amplification template RNA will lead to transcription of the desired gene. 

Because some 17 RNA polymerase is required to initiate the amplification, 17 RNA polymerase 
can be introduced into cells along with the template(s) to prime the transcription reaction. The 
polymerase can be introduced as a protein or on a plasmid encoding the RNA polymerase. For a 
further discussion of 17 systems and their use for transforming cells, see, e.g., International 

i0 Publication No. WO 94/26911; Studier andMoffatt, /. Mol Biol (1986) 189:113-130; Deng and 
Wolff, Gene (1994) 143:245-249; Gao et al. 9 Biochem. Biopkys. Res. Common. (1994) 
200:1201-1206; Gao and Huang, Nuc. Acids Res. (1993) 21:2867-2872; Chen et oi, Nuc. Acids 
Res. (1994) 22:2114-2120; and US Patent No. 5,135,855. 

The immunogenic compositions of the invention may further comprise diluents, such as 

•5 water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such as wetting or 
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emulsifying agents, pH buffering substances, and the like may be included in the immunogenic 
composition. 

The immunogenic compositions used in the invention can be administered to an animal. 
Animals suitable for use in the methods of the invention include humans and other primates, 
including non-human primates such as chimpanzees, and other apes and monkey species; farm 
animals such as cattle, sheep, pigs, goats and horses, domestic animals such as dogs and cats; 
laboratory animals including rodents such as mice, rats and guinea pigs; birds, including 
domestic, wild and game birds such as chickens, turkeys and other gallinaceous birds, ducks, 
geese and the like. Animals suitable for use in die invention can be of any age, including both 
adult and newborn. Transgenic animals can also be used in the invention. 

The immunogenic compositions of the invention can be used to treat or prevent SARS 
virus-related diseases. 

The compositions of the invention are preferably pharmaceutically acceptable and 
pharmacologically acceptable. In particularly, the compositions afe preferably not biologically 
or otherwise undesirable, i.e. t the material may be administered to an individual in a formulation 
or composition without causing any undesirable biological effects or interacting in a deleterious 
manner with any of the components of the composition in which it is contained 

Pharmaceutically acceptable salts can also be used in compositions of the invention, for 
example, mineral salts such as hydrochlorides, hydrobromides, phosphates, or sulfates, as well as 
salts of organic acids such as acetates, proprionates, malonates, or benzoates. Especially useful 
protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, 
thyroglobulin, ovalbumin; tetanus toxoid, and other proteins well known to those of skill in the 
art Compositions of the invention can also contain liquids or excipients, such as water, saline, 
glycerol, dextrose, ethanol, or the like, singly or in combination, as well as substances such as 
wetting agents, emulsifying agents, or pH buffering agents. Liposomes can also be used as a 
earner for a composition of the invention. 

SARS specific reagents and analytical assays may be used in the manufacture and testing 
of the vaccines of the invention. Such analytical assays include, for example: 1) virus titration 
and plaque assays for quantitation of infectious vims particles, 2) a neutralization assay with 
constant virus and varying serum dilutions, 3) a two step RT-PCR system (Light Cycler-Roche) 
for detection of negative strand viral RNA, with the target sequence located within the N gene, 
providing highest possible sensitivity, and 4) ELBA and western blot assays for detection and 
qualification of viral proteins. 

In addition, rabbit polyclonal antiserum has been generated to obtain antibody reagents 
(and demonstrate induction of neutralizing antibodies) against the SARS-CoV. A sample 
protocol for generating such reagents is set forth below. The virus is first cultivated in suitable 
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cell culture, such as Vero cells, and pelleted through a 20% sucrose (w/v) cushion. The pellet is 
then subjected to a glycerol potassium-tartrate gradient for further purification. The virus- 
containing fraction is then diluted and pelleted by dtracentrifugation. The pellet is then 
dissolved in PBS and the virus is inactivated with C3H4O2 (beta-propiolactone, BPL). Two 
5 rabbits are immunized subcutaneously (SC) on day 0, 14, and 28 with lxlO 9 inactivated viral 
particles mixed with IFA as adjuvant Rabbits are bled on days 0 (pre-inoculation), 13, 28, and 
35 (1 week after 3rd immunization). Sera obtained from this protocol were tested for their 
reactivity against SARS-CoV proteins in western blots and found to react with the major 
structural proteins spike (S), membrane (M), and nucleocapsid (N). 
10 /. Emerging coronavirus vaccines 

The SARS epidemic has lead to increased awareness of viral infections caused by 
coronaviruses. The vaccines of the invention may be adapted to prevent or treat emerging strains 
of coronavirus, including emerging strains of 'SARS virus. 

The invention provides a vaccine comprising an inactivated (or killed) human coronavirus, 
.5 an attenuated human coronavirus, a split human coronavirus preparation, or a recombinant or 
purified subunit formulation of one or more antigens from a human coronavirus, wherein the 
human coronavirus is not the SARS coronavirus. Optionally, the human coronavirus is not the 
229E coronavirus. Optionally, the human coronavirus is not the OC43 coronavirus. Optionally, 
the human coronavirus is not the NL63 coronavirus. Thus the invention provides a vaccine as 
0 defined above, wherein the human coronavirus is not the SARS coronavirus, is not the 229E 
coronavirus, is not the OC43 coronavirus and is not the NL63 coronavirus. Such vaccines are 
useful for preventing and/or treating emerging human coronavirus infections. 

The invention also provides a vaccine comprising: (a) an inactivated (or killed) human 
coronavirus, an attenuated human coronavirus, a split human coronavirus preparation, or a 
5 recombinant or purified subunit formulation of one or more antigens from a human coronavirus, 
wherein the human coronavirus is not the SARS coronavirus, as defined above; and (b) an 
inactivated (or killed) human coronavirus, an attenuated human coronavirus, a split human 
coronavirus preparation, or a recombinant or purified subunit formulation of one or more 
antigens from a human coronavirus, wherein the human coronavirus is the SARS coronavirus. 
) Such vaccines are useful for preventing and/or treating both SARS and other human 
coronaviruses. 

As well as providing vaccines comprising antigens from more than one type of 
coronavirus, the invention also provides vaccines comprising antigens from more than one strain 
of the same coronavirus e.g. different strains of the SARS coronavirus, or different strains of a 
: coronavirus other than the SARS coronavirus. In one embodiment, the vaccine comprises 
antigens from at least two strains of coronavirus, or at least three strains of coronavirus. In one 
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embodiment, the vaccine comprises antigens from at least two types of coronavinis. In one 
embodiment, the vaccine comprises at least one antigen from each of the known types of 
coronaviiuses (type I, type II and type HI). Such vaccines follow the model of current influenza 
vaccines. 

Hie selection of coionaviruses and/or coronavinis strains for use in vaccines of the 
invention can be based on various criteria. For instance, selection may be based on viruses and/or 
strains that have been detected in the geographical region (e.g. northern or southern hemisphere, 
a particular country, etc.) where the vaccine targeted. Selection may be based on the results of 
animal surveillance e.g. of viruses detected in cat populations. Selection may be based on the 
results of clinical surveillance e.g. of viruses detected in patients hospitalized with respiratory 
infection. Selection may be performed every year e.g. prior to winter. Vaccines may also be 
administered yearly, again following the model of current influenza vaccines. 

Preferred vaccines are sufficiently immunogenic to provide a neutralizing immune 
response, and more preferably a protective and/or therapeutic immune response. Particularly 
preferred vaccines meet the efficacy requirements that may be specified by the WHO from time 
to time. 

A preferred subunit antigen for inclusion in vaccines of the invention is a purified spike 
protein, more preferably in oligomeric (e.g. trimeric) form. The spike protein may or my not be . 
cleaved e.g. into its SI and S2 products. 

The techniques disclosed above for selecting viruses and/or strains for production of 
vaccines can also be used to select appropriate viruses and/or strains from which HR1 and HR2 
sequences can be obtained for providing therapeutic peptides, as disclosed above. 

UI. DIAGNOSTIC COMPOSITIONS AND METHODS OF THE INVENTION 

The invention provides methods for detecting the SARS coronavinis. Detection in patient 
samples can be used to detect and diagnose infections by the virus. Detection in donated blood 
can be used to prevent inadvertent transmission of the virus during blood transplant procedures 
Detection methods fall into three main categories: detection of SARS virus nucleic acids; 
detection of SARS virus proteins; and detection of anti-SARS virus immune responses. Hie 
invention provides all such methods. 

As used herein when referring to nucleotide sequences, particularly oligonucleotide probes 
and primers, "similar" sequences includes those sequences that are at least 90% identical to 
known SARSV genomic sequence and includes sequences that are at least 95 % identical, at 
least 99% identical and 100% identical to the SARSV genomic sequence over the length of the 
probe or primer. 
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As used herein, the term "target nucleic acid region" or "target nucleic acid" denotes a * 
nucleic acid molecule with a "target sequence" to be amplified. Hie target nucleic acid may be 
either single-stranded or double-stranded and may include other sequences besides the target 
sequence, which may not be amplified. Hie term "target sequence" refers to the particular 

5 nucleotide sequence of the target nucleic acid which is to be amplified. The target sequence may 
include a probe-hybridizing region contained within the target molecule with which a probe will 
form a stable hybrid under desired conditions. The "target sequence" may also include the 
complexing sequences to which the oligonucleotide primers complex and be extended using the 
target sequence as a template. Where the target nucleic acid is originally single-stranded, the 

10 term "target sequence" also refers to the sequence complementary to the "target sequence" as 
present in the target nucleic acid. If the "target nucleic acid" is originally double-stranded, the 
term "target sequence" refers to both the plus (+) and minus (-) strands. 

The term "primer" or "oligonucleotide primer" as used herein, refers to an oligonucleotide 
which acts to initiate synthesis of a complementary DNA strand when placed under conditions in 

5 which synthesis of a primer extension product is induced ie. in the presence of nucleotides and a • 
polymerization-inducing agent such as a DNA or RNA polymerase and at suitable temperature, • 
pH, metal concentration, and salt concentration. The primer is preferably single-stranded for 
maximum efficiency in amplification, but may alternatively be double-stranded. If double- 
stranded, the primer is first treated to separate its strands before being used to prepare extension 

0 products. This denaturation step is typically eflFected by heat, but may alternatively be carried 
out using alkali, followed by neutralization. Thus, a "primer" is complementary to a template, 
and complexes by hydrogen bonding or hybridization with the template to give a primer/template: 
complex for initiation of synthesis by a polymerase, which is extended by the addition of 
covalently bonded bases linked at its 3' end complementary to the template in the process of 

5 DNA synthesis. 

As used herein, the term "probe" or "oligonucleotide probe" refers to a structure comprised 
of a polynucleotide, as defined above, that contains a nucleic acid sequence complementary to a 
nucleic acid sequence present in the target nucleic acid analyte. The polynucleotide regions of 
probes may be composed of DNA, and/or RNA, and/or synthetic nucleotide analogs. When an 
) "oligonucleotide probe" is to be used in a 5' nuclease assay, such as the TaqMan™ technique, the 
probe will contain at least one fluorescer and at least one quencher which is digested by the 5' 
endonuclease activity of a polymerase used in the reaction in order to detect any amplified target 
oligonucleotide sequences. In this context, the oligonucleotide probe will have a sufficient 
number of phosphodiester linkages adjacent to its 5* end so that the 5' to 3' nuclease activity 
employed can efficiently degrade the bound probe to separate the fluoresces and quenchers. 
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When an oligonucleotide probe is used in the TMA technique, it will be suitably labeled, as 
described below. 

It will be appreciated that the hybridizing sequences need not have perfect 
complementarity to provide stable hybrids, hi many situations, stable hybrids will form where 
5 fewer than about 10% of the bases are mismatches, ignoring loops of four or more nucleotides. 
Accordingly, as used herein the term "complementary" refers to an oligonucleotide that forms a 
stable duplex with its "complement" under assay conditions, generally where there is about 90% 
or greater homology. 

Hie terms "hybridize" and "hybridization" refer to the formation of complexes between 

10 nucleotide sequences which are sufficiently complementary to form complexes via Watson-Crick 
base pairing. Where a primer "hybridizes" with target (template), such complexes (or hybrids) 
are sufficiently stable to serve the priming function required by e.g. the DNA polymerase to 
initiate DNA synthesis. 

Stringent hybridization conditions will typically include salt concentrations of less than 

15 about 1 M, more usually less than about 500 mM and preferably less than about 200 mM. 
Hybridization temperatures can be as low as 5°C, but are typically greater than 22°C, more 
typically greater than about 30°C, and preferably in excess of about 37°C. Longer fragments 
may require higher hybridization temperatures for specific hybridization. Other factors may 
affect the stringency of hybridization, including base composition and length of the 

20 complementary strands, presence of organic solvents and extent of base mismatching, and the 
combination of parameters used is more important than the absolute measure of any one alone. 
Other hybridization conditions which may be controlled include buffer type and concentration, 
solution pH, presence and concentration of blocking reagents to decrease background binding 
such as repeat sequences or blocking protein solutions, detergent type(s) and concentrations, 

25 molecules such as polymers which increase the relative concentration of the polynucleotides, 
metal ion(s) and their concentration(s), chelators) and their concentrations, and other conditions 
known in the art. Less stringent, and/or more physiological, hybridization conditions are used 
where a labeled polynucleotide amplification product cycles on and off a substrate linked to a 
complementary probe polynucleotide during a real-time assay which is monitored during PCR 

50 amplification such as a molecular beacon assay. Such less stringent hybridization conditions can 
also comprise solution conditions effective for other aspects of the method, for example reverse 
transcription or PCR. 

As used herein, a "biological sample" refers to a sample of tissue, cells or fluid isolated 
from a subject, that commonly includes antibodies produced by the subject. Typical samples 
J5 include but are not limited to, blood, plasma, serum, fecal matter, urine, bone marrow, bile, 
spinal fluid, lymph fluid, samples of the skin, secretions of the skin, respiratory, intestinal, and 
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genitourinary tracts, tears, saliva, sputum, mucous, milk, blood cells, organs, tissues, biopsies 
(e.g. lung, liver, kidney) and also samples of in vitro cell culture constituents including but not 
limited to conditioned media resulting from the growth of cells and tissues in culture medium 
e.g. recombinant cells, and cell components. Other samples that may be used for diagnosis 

5 include stool samples and nasopharyngeal aspirates. 

The term "antibody" encompasses polyclonal and monoclonal antibody preparations, as 
well as preparations including hybrid antibodies, altered antibodies, chimeric antibodies and, 
humanized antibodies, as well as: hybrid (chimeric) antibody molecules (see, for example, 
Winter et aL (1991) Nature 349:293-299; and US Patent 4,816,567); F(ab f ) 2 and F(ab) 

10 fragments; Fv molecules (noncovalent heterodimers, see, for example, Inbar et aL (1972) Proc 
Natl Acad Sci USA 69:2659-2662; and Ehrlich et aL (1980) Biochem 19:4091-4096); 
single-chain Fv molecules (sFv) (see, e.g. , Huston et aL (1988) Proc Natl Acad Sci USA 
85:5879-5883); oligobodies; dimeric and trimeric antibody fragment constructs; minibodies (see, 
e.g, 9 Pack et aL (1992) Biochem 31:1579-1584uCumber et aL. (1992) J Immunology 

.5 149B: 120-126); humanized antibody molecules (see, e.g., Riechmann et aL (1988) Nature 
332:323-327; Verhoeyan et aL (1988) Science 239:1534-1536; and UK Patent Publication No. 
GB 2,276,169, published 21 September 1994); and, any functional fragments obtained from such 
molecules, wherein such fragments retain specific-binding properties of the parent antibody 
molecule. 

0 As used herein, the term "monoclonal antibody" refers to an antibody composition having 

a homogeneous antibody population. The term is not limited regarding the species or source of : 
the antibody, nor is it intended to be limited by the manner in which it is made. The term 
encompasses whole immunoglobulins. 

: Methods of making polyclonal and monoclonal antibodies are known in the art. Polyclonal 

5 antibodies are generated by immunizing a suitable animal, such as a mouse, rat, rabbit, sheep or • 
goat, with an antigen of interest In order to enhance immunogenicity, the antigen can be linked 
to a carrier prior to immunization. Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 

3 liposomes), and inactive virus particles. Such carriers are well known to those of ordinary skill in 
the art Furthermore, the antigen may be conjugated to a bacterial toxoid, such as toxoid from 
diphtheria, tetanus, cholera, etc., in order to enhance the immunogenicity thereof. 

Rabbits, sheep and goats are preferred for the preparation of polyclonal sera when large 
volumes of sera are desired These animals are good design choices also because of the 

> availability of labeled anti-rabbit, anti-sheep and anti-goat antibodies. Immunization is generally 
performed by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as 
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Freund's complete adjuvant ("FCA"), and injecting the mixture or emulsion parenterally 
(generally subcutaneously or intramuscularly). Hie animal is generally boosted 2-6 weeks later 
with one or more injections of the antigen in saline, preferably using Round's incomplete 
adjuvant ("HA"). Antibodies may also be generated by in vitro immu nization, using methods 

5 known in the art Polyclonal antisera is then obtained from the immunized animal. 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
(1975) Nature 256:495^97, or a modification thereof, as described above. 
Nucleic acid detection methods 

There are many well known methods of amplifying targeted sequences, such as the 

10 polymerase chain reaction (PCR), reverse transcription PCR (RT-PCR), the ligase chain reaction 
(LCR), the strand displacement amplification (SDA), and the nucleic acid sequence-based 
amplification (NASB A), transcription-mediated amplification (TMA) to name a few. These 
methods are described generally in the following references: (PCR) US Patents 4,683,195, 
4,683,202, and 4,800,159; (RT-PCR) US patent 5310,652, 5,322,770; (LCR) EP Application 

5 No., 320,308 published Jun. 14, 1989; (SDA) US Pat Nos. 5,270,184, and 5,455,166 and 
"Empirical Aspects of Strand Displacement Amplification" by G. T. Walker in PCR Methods 
and Applications, 3(l):l-6 (1993), Cold Spring Harbor Laboratory Press; (TMA) US Patent No. 
5,399,491 , and (NASB A) "Nucleic Acid Sequence-Based Amplification (NASB A™)" by L. 
Malek et aL 9 Ch. 36 in Methods in Molecular Biology, Vol. 28: Protocols for Nucleic Acid 

0 Analysis by Nonradioactive Probes, 1994 Ed. P. G. Isaac, Humana Press, Inc., Totowa, N.J. PCR 
methods may include variations that permit quantitation of the target sequence, for example, by 
real time PCR analysis (e.g., as described in US patents 5,210,015, 5,487,972, 5,994,056, 
6,171,785 inter alia). (Each of the above references are hereby incorporated by reference). 

One embodiment of the method of the invention for detecting the presence of S ARS virus 

5 in a sample comprises providing a sample suspected of containing a S ARS virus nucleic acid 
target, amplifying a template sequence contained within said S ARS virus nucleic acid target by 
any known technique of nucleic acid amplification, including any of those mentioned herein, 
using the oligonucleotide primers described herein, particularly those primers comprising the kits 
described herein, and detecting the amplified template sequence, wherein the presence of the 

) amplified template sequence indicates the presence of S ARS virus in said sample. 

Amplification techniques generally involve the use of two primers. Where a target 
sequence is single-stranded, the techniques generally involve a preliminary step in which a 
complementary strand is made in order to give a double-stranded target* The two primers 
hybridize to different strands of the double-stranded target and are then extended The extended 

> products can serve as targets for further rounds of hybridization/extension. The net effect is to 
amplify a template sequence within the target, the 5* and 3* temini of the template being defined 
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by the locations of the two primers in the target As an alternative, if one or both of the primers 
contains a promoter sequence then the target can be amplified (by transcription) using a RNA 
polymerase (as in IMA). 

The present invention provides methods and kits for amplifying and/or detecting a template 
or target sequence in the SARSV viral nucleic acid. The invention provides a kit comprising 
primers for amplifying a template sequence contained within a SARSV nucleic acid target, the 
kit comprising a first primer and a second primer, wherein the first primer comprises a sequence 
substantially complementary to a portion of said template sequence and the second primer 
comprises a sequence substantially complementary to a portion of the complement of said 
template sequence, wherein the sequences within said primers which have substantial 
complementarity define the termini of the template sequence to be amplified. 

Kits of the invention may further comprise a probe which is substantially complementary 
to the template sequence and/or to its complement and which can hybridize thereto. This probe 
can be used in a hybridization technique to detect amplified template, or to isolate (Le. "capture) 
die amplified template or the original target nucleic acid. 

Kits of the invention may further comprise primers and/or probes for generating and 
detecting an internal standard, in order to aid quantitative measurements (e.g Rile et al 1997 
Biotechniques 23:34-36). 

Kits of the invention may further comprise a DNA polymerase, which will generally be a 
thermostable DNA polymerase where a non-isothermal amplification process is to be used. The 
kits may also comprise supplies of dNTPs, a magnesium salt (e.g. MgCl 2 ), buffer solutions, etc. 

Kits of the invention may comprise more than one pair of primers (e.g. for nested 
amplification), and one primer may be common to more than one primer pair. The kit may also 
comprise more than one probe. 

Oligomer Probes and Primers 

In connection with the nucleic acid detection methods of the present invention described 
above, oligomers having sequence similarity, or complementarity, to the SARSV genome are 
useful. The SARSV genome sequences mentioned herein may be used to produce probes and 
primers which can be used in assays for the detection of nucleic acids in test samples. The probes 
may be designed from conserved nucleotide regions of the polynucleotides of interest or from 
non-conserved nucleotide regions of the polynucleotide of interest The design of such probes for 
optimization in assays is within the skill of those of ordinary skill in the art. Generally, nucleic 
acid probes are developed from non-conserved or unique regions when maximum specificity is 
desired, and nucleic acid probes are developed from conserved regions when assaying for 
nucleotide regions that are closely related to, for example, different members of a multi-gene 
family or in related species like mouse and man. 
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Using as a basis the SARSV genome which can be found as described herein, and/or 
preferably conserved regions of the SARSV genome, and/or the particularly described primer 
and probe sequences as disclosed herein, oligomers of approximately 8 nucleotides or more can 
be prepared which hybridize with the positive strand(s) of SARSV RNA or its complement, as 

5 well as to SARSV cDNAs. These oligomers can serve as probes for the detection (including 
isolation and/or labeling) of polynucleotides which contain SARSV nucleotide sequences, and/or 
as primers for the transcription and/or replication of targeted SARSV sequences. The oligomers 
contain a targeting polynucleotide sequence, which is comprised of nucleotides which are 
complementary to a target SARSV nucleotide sequence; the sequence is of sufficient length and 

10 complementarity with the SARSV sequence to form a duplex which has sufficient stability for 
the purpose intended For example, if the purpose is the isolation, via immobilization, of an 
analyte containing a target SARSV sequence, the oligomers would contain a polynucleotide 
region which is of sufficient length and complementarity to the targeted SARSV sequence to 
afford sufficient duplex stability to immobilize the analyte on a solid surface, via its binding to 

15 the oligomers, under the isolation condition?. For example, also, if the oligomers are to serve as 
primers for the transcription and/or replication of target SARSV sequences in an analyte 
polynucleotide, the oligomers would contain a polynucleotide region of sufficient length and 
complementarity to the targeted SARSV sequence to allow the polymerizing agent to continue 
replication from the primers which are in stable duplex form with the target sequence, under the 

10 polymerizing conditions. For example, also, if the oligomers are to be used as label probes, or 
are to bind to multimers, the targeting polynucleotide region would be of sufficient length and 
complementarity to form stable hybrid duplex structures with the label probes and/or multimers 
to allow detection of the duplex. The oligomers may contain a minimum of about 4 contiguous 
nucleotides which are complementary to targeted SARSV sequence; usually the oligomers will 

15 contain a minimum of about 8 contiguous nucleotides which are complementary to the targeted 
SARSV sequence, and preferably will contain a minimum of about 14, 15, 16, 17, 18, 19, 20,21, 
22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotides and up to about 50, 75, 100, 200 
contiguous nucleotides or more, which are complementary to the targeted SARSV sequence. 
Typically, for use in the amplification based methods (for example, PCR, RT-PCR, TMA) 

0 oligomers will be used as primer sets such that one member of the primer set has sequence 

similarity or complementarity to a more conserved (among coronaviruses) portion of the SARSV 
genome and the other member of the primer set has sequence similarity or complementarity to a 
less conserved portion. The primer sets can be used to amplify the target region in ways that are 
well known in the art Typically, the 5' untranslated region (SUTR) and the 3' untranslated 

5 region (3TJTR) are among the most conserved regions. Figure 8 shows an alignment of the 
5TJTR of several coronaviruses. Figure 10 shows an alignment of the 3TJTR of several 
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coronaviruses. Figures 9 and 11 show the sequences of preferred primers for amplification of the 
5TJTR and 3TJTR, respectively. Other primers and probes can readily be designed based on the 
sequence alignments provided herein. 

The oligomer, however, need not consist only of the sequence which is complementary to 

5 the targeted SARSV sequence. It may contain in addition, nucleotide sequences (e.g.promoters) 
or other moieties which are suitable for the purposes for which the oligomers are used For 
example, if the oligomers are used as primers for die amplification of SARSV sequences via, for 
example, PCR, they may contain sequences which, when in duplex, form restriction enzyme sites 
which facilitate the cloning of the amplified sequences. For example, also, if the oligomers are to 

10 be used as "capture probes" in hybridization assays, they would contain in addition a binding 
partner which is coupled to the oligomer containing the nucleotide sequence which is 
complementary to the targeted SARSV sequence. Other types of moieities or sequences which 
are useful of which the oligomers may be comprised or coupled to, are those which are known in 
the art to be suitable for a variety of purposes, including the labeling of nucleotide probes. 

.5 Table 4 (SEQ ID NOS: 1021-6020) shows forward and reverse primers that are useful for 

nucleic acid amplification of SARSV for diagnostic and screening methods. 

Preferred primers and probes for S ARS nucleic acid detection for diagnostic and screening 
are SEQ ID NOS: 7332-7336 (forward primers), SEQ ID NOS: 7337-7341 (reverse primers) and 
SEQ ID NOS: 7342-7352 (probes). These primers and probes are useful for detection of 

0 sequences in the 3' UTR. 

Any of the above forward primers may be used in combination with any of the above 
reverse primers for amplification of SARSV nucleic acid. The amplified product may be 
detected (or captured) with any of the abovfe probes. Particularly preferred combinations of 
forward and reverse primers and the probes for detecting the amplified product include: Forward 

5 SEQ ID NO: 7332 with reverse SEQ ID NO: 7337, 7338, 7339 or 7341 and probe SEQ ID NO: 
7342; forward SEQ ID NO: 7333 or 7334 with reverse SEQ ID NO: 7340 and any of probes 
SEQ ID NO: 7343-7351; Forward SEQ ID NO: 7335 and reverse SEQ ID NO: 7340 or 7341 and 
any of probes SEQ ID NO: 7342-7352. Other combinations of forward and reverse primers and 
appropriate probes can readily be determined by those skilled in the art from the above 

D information. 

Additional preferred primers and probes for SARS nucleic acid detection for diagnostic 
and screening are SEQ ID NOS: 7353-7362 (forward primers), SEQ ID NOS: 7363-7373 
(reverse primers) and SEQ ID NOS: 7374-7385 (probes). The primers and probes are useful for 
detection of sequences in the 5* UTR. 
5 The above primers may be used in combination for amplification of SARSV nucleic acid 

as follows: any of forward primers SEQ ID NO: 7353-7356 with any of reverse primers SEQ ID 
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NO: 7363-7366, 7368 and the amplified product detected (or captured) with probes SEQ ID NO: 
7374; any of forward primers SEQ ID NO: 7357-7362 with any of reverse primers SEQ ID NO: 
7367, 7369-7373 and the amplified products detected (or captured) with any of probes SEQ ID 
NO: 7375-7385. Particularly preferred combinations of forward and reverse primers and probes 
are: Forward primers SEQ ID NO: 7353-7356 with any of reverse primers SEQ ID NO: 7363- 
7366 and probes SEQ ID NO: 7374; forward primers SEQ ID NO: 7357-7358 with reverse 
primers SEQ ID NO: 7367, 7369 and probes SEQ ID NO: 7375 or 7376; Forward primers SEQ 
ID NO: 7357-7359 with reverse primers SEQ ID NO: 7367, 7369 or 7370 and probe SEQ ID 
NO: 7375 or 7376. More preferred are combinations of SEQ ID NO: 7353 or 7354 with SEQ ID 
NO: 7363 or 7364 and probe SEQ ID NO: 7374. Other combinations of forward and reverse 
primers and appropriate probes can readily be determined by those skilled in the art from the 
above information. A particularly conserved octanucleotide sequence (SEQ ID NO: 7386) occurs 
in the 3TJTR of SARS (approximately 70-80 bases from the 3' end) and of several other 
Coronaviruses that may be particularly useful in identifying SARSV. Primers including in this 
region are preferably combined with reverse primers from regions of sequence that are more 
specific for SARS. 

In addition to the above, the intergenic sequence (IS) that is characteristic of Coronavirus 
has been identified in SARSV (see above). The IS minimaUy comprises the sequence ACGAAC 
(SEQ ID NO: 7293) which occurs upstream of each open reading frame (ORF) in the viral 
genome. The 5TJTR which includes the IS is spliced onto the 5' end of each viral mRNA at or 
adjacent to the site of the IS. Thus, primers comprising theK or its complement are useful for 
amplifying viral nucleic acids, including cDNA made from the viral RNAs. The invention thus 
comprises a set of primers in which one primer comprises ACGAAC (SEQ ID NO: 7293) or its 
complement (SEQ ID NO: 7387) and one primer comprises any appropriate sequence from the 
SARS genome, or a complementary sequence. Useful probes for detecting and/or capturing the 
viral RNAs or cDNA made from the viral RNAs may also comprise the IS sequence, or its 
complement, described above. 

One set of primers for amplification of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 6562, 6563, 6564 and 6565. Of these, 6562 & 6564 are sense primers and 6563 & 
6565 are antisense primers. Primers SEQ ID NOS: 6562 & 6565 may be used in a first 
amplification, with a second nested amplification being performed using primers SEQ ID NOS: 
6563 & 6564. In some embodiments of the invention, these four primers are excluded 

One kit for amplification and detection of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 6567 & 6568 as primers, and SEQ ID NO 6566 as a probe (typically labeled e.g. 
with TAMRA and/or FAM) for the amplified sequence. In some emlrodiments of the invention, 
these primers and probe are excluded 
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One kit for amplification and detection of SARS sequences, particularly by RT-PCR, uses 
SEQ ID NOs 7395 & 6568 as primers, and SEQ ID NO 6566 as a probe (typically labeled e.g. 
with TAMRA and/or FAM) for the amplified sequence. In some embodiments of the invention, 
these primers and probe are excluded 
5 One kit for amplification of SARS sequences, particularly the nucleocapsid gene, uses 

SEQ ID NOs 6560 & 6561 as primers. In some embodiments of the invention, these primers are 
excluded 

One kit for amplification of SARS sequences uses SEQ ID NOs 6496, 6497, 6562, 6563, 
6564 & 6565 as primers. In some embodiments of the invention, these primers are excluded 
10 One kit for amplification of SARS sequences uses SEQ ID NOs 6562, 6563, 6564 & 6565 

as primers. In some embodiments of the invention, these primers are excluded 

One kit for amplification of SARS sequences uses SEQ ID NOs 6500, 6501, 6502 & 6503 
as primers. In some embodiments of the invention, these primers are excluded 

One kit for amplification of SARS sequences uses SEQ ID NOs 6496, 6497, 6500, 6501,. 
15 6502, 6503, 6562, 6563, 6564 & 6565 as primers. In some embodiments of the invention, these 
primers are excluded 

One kit for amplification and detection of SARS sequences, particularly by realtime (e.g. - 
TaqMan™) PCR, uses SEQ ID NOs 6567 & 6568 as primers, and SEQ ID NO 6566 as a probe 
(typically labeled e.g. with TAMRA and/or FAM) for the amplified sequence. In some 
iO embodiments of the invention, these primers and probe are excluded 

One kit for amplification and detection of SARS sequences, particularly by realtime (e.g. • 
TaqMan™) PCR, uses SEQ ID NOs 7395 & 6568 as primers, and SEQ ID NO 6566 as a probe • 
(typically labeled e.g ; with TAMRA and/or FAM) for the amplified sequence. In some 
embodiments of the invention, these primers and probe are excluded 
15 One kit for amplification and detection of SARS sequences uses SEQ ID NOs 6562, 6565 

and 6568 as primers, and SEQ ID NOs 7396 and 7397 as probes (typically labeled e.g. with 
TAMRA and/or FAM) for the amplified sequence. In some embodiments of the invention, these 
primers and probe are excluded 

One kit for amplification and detection of SARS sequences uses an oligonucleotide 
10 comprising SEQ ID NO: 9780 as a forward primer, an oligonucleotide comprising SEQ ID NO: 
9781 as a reverse primer, and an oligonucleotide comprising SEQ ID NO: 9782 as a probe. 

Preferred sequences for use with RT-PCR and LightCycler analysis include SEQ ID NOs 
6562, 6568, 6565, 7396 & 7397. In some embodiments of the invention, these primers and probe 
are excluded 

•5 The preparation of the oligomers is by means known in the art, including, for example, by 

methods which include excision, transcription, or chemical synthesis. The target sequences 
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and/or regions of the genome which are selected to which the targeting polynucleotides of the 
oligomers are complementary depend upon the purpose. For example, if the goal is to screen for 
the presence of SARSV in biological samples. (e.g. blood, respiratory material, liver, lung), the 
preferred oligomers would be used as probes and/or primers, and would hybridize to conserved 
regions of the SARSV genome. Some of the conserved regions of the SARSV genome to which 
the oligomers may bind are described herein, for example, 5XJTR and 3TJTR. 

In the basic nucleic acid hybridization assay, single-stranded analyte nucleic acid (either 
DNA or RNA) is hybridized to a nucleic acid probe, and resulting duplexes are detected. The 
probes for SARSV polynucleotides (natural or derived) are a length which allows the detection 
of unique viral sequences by hybridization. While 6-8 nucleotides may be a workable length, 
sequences of 10-12 nucleotides are preferred, and about 13, 14, 15, 16, 17, 18, 19, 20, or 21 or 
more nucleotides or more appears optimal. Preferably, these sequences will derive from regions 
which lack heterogeneity. These probes can be prepared using routine methods, including 
automated oligonucleotide synthetic methods. Among useful probes, for example, are those 
derived from less conserved regions of the SARSV genome. Regions of the genome that are 
typically less conserved can be readily ascertained from the sequence alignments provided 
herein, as well as by any other well known techniques. A complement to any unique portion of • 
the SARSV genome will be satisfactory. For use as probes, complete complementarity is 
desirable, though it may be unnecessary as the length of the fragment is increased. 

For use of such probes as agents to detect the presence of SARSV polynucleotides (for 
example in screening for contaminated blood or for diagnosing infected individuals), the 
biological sample to be analyzed, such as, without limitation, blood, serum, lung, liver; mucous,- 
kidney, saliva, or sputum, may be treated, if desired, to extract the nucleic acids contained 
therein. The resulting nucleic acid from the sample may be subjected to gel electrophoresis or 
other size separation techniques; alternatively, the nucleic acid sample may be dot blotted 
without size separation. In order to form hybrid duplexes with the targeting sequence of the 
probe, the targeted region of the analyte nucleic acid must be in single stranded form. Where the 
sequence is naturally present in single stranded form, denaturation will not be required. 
However, where the sequence is present in double stranded form, the sequence will be denatured. 
Denaturation can be carried out by various techniques known in the art Subsequent to 
denaturation, the analyte nucleic acid and probe are incubated under conditions which promote 
stable hybrid formation of the target sequence in the probe with the putative targeted sequence in 
the analyte, and the resulting duplexes containing the probe(s) are detected. 

Detection of the resulting duplex, if any, is usually accomplished by the use of labeled 
probes; alternatively, the probe may be unlabeled, but may be detectable by specific binding with 
a ligand which is labeled, either directly or mdirectly. Suitable labels, and methods for labeling 
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probes and ligands are known in the art, and include, for example, radioactive labels which may 
be incorporated by known methods (e.g., nick translation or kinasing), biotin, fluorescent groups, 
chemiluminescent groups (e.g., dioxetanes, particularly triggered dioxetanes), enzymes, 
antibodies, and the like. 

5 The region of the probes which are used to bind to the analyte can be made completely 

complementary to the SARSV genome. Therefore, usually high stringency conditions are 
desirable in order to prevent false positives. However, conditions of high stringency should only 
be used if the probes are complementary to regions of the viral genome which lack 
heterogeneity. The stringency of hybridization is determined by a number of factors during 
10 hybridization and during the washing procedure, including temperature, ionic strength, length of 
time, and concentration of formamide. These factors are outlined in, for example, Maniatis T. 
(1982). 

Variations of this basic scheme which are known in the art, including those which facilitate 
separation of the duplexes to be detected from extraneous materials and/or which amplify the 

15 signal from the labeled moiety, may also be used A number of these variations are reviewed in, . 
for example: Matthews & Kricka (1988), Analytical Biochemistry 169: 1 ; Landegren et al. 
(1988), Science 242:229; and Mittlin (1989), Clinical Chem. 35:1819. These and the following 
publications ; describing assay formats are hereby incorporated by reference herein. Probes 
suitable for detecting SARSV in these assays are comprised of sequences which hybridize with 

JO target SARSV polynucleotide sequences to form duplexes with the analyte strand, wherein the 
duplexes are of sufficient stability for detection in the specified assay system. 

A suitable variation is, for example, one which is described in US Pat. No. 4,868,105, 
issued Sep. 9, 1989, and in EPO Publication No. 225,807 (published Jun. 16, 1987). These 
publications describe a solution phase nucleic acid hybridization assay in which the analyte 

15 nucleic acid is hybridized to a labeling probe set and to a capturing probe set The probe-anal yte 
complex is coupled by hybridization with a solid-supported capture probe that is complementary 
to the capture probe set This permits the analyte nucleic acid to be removed from solution as a 
solid phase complex. Having the analyte in the form of a solid phase complex facilitates 
subsequent separation steps in the assay. The labeling probe set is complementary to a labeled 

10 probe that is bound through hybridization to the solid phase/analyte complex. 

The polymerase chain reaction (PCR) is a technique for amplifying a desired nucleic acid 
sequence (target) contained in a nucleic acid or mixture thereof. In PCR, a pair of primers are 
employed in excess to hybridize to the complementary strands of the target nucleic acid. The 
primers are each extended by a polymerase using the target nucleic acid as a template. The 

»5 extension products become target sequences themselves, following dissociation from the original 
target strand New primers then are hybridized and extended by a polymerase, and the cycle is 
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repeated to geometrically increase the number of target sequence molecules. PGR is disclosed in 
US Pat Nos. 4,683,195 and 4,683,202, which are incorporated herein by reference. 

The Ligase Chain Reaction (LCR) is an alternate method for nucleic acid amplification. In 
LCR, probe pairs are used which include two primary (first and second) and two secondary (third 

5 and fourth) probes, all of which are employed in molar excess to target The first probe 

hybridizes to a first segment of the target strand, and the second probe hybridizes to a second 
segment of the target strand, the first and second segments being contiguous so that the primary 
probes abut one another in 5' phosphate-3 f hydroxyl relationship, and so that a ligase can 
covalently fuse or ligate the two probes into a fused product In addition, a third (secondary) 

10 probe can hybridize to a portion of the first probe and a fourth (secondary) probe can hybridize 
to a portion of the second probe in a similar abutting fashion. Of course, if the target is initially 
double stranded, the secondary probes also will hybridize to the target complement in the first 
instance. Once the ligated strand of primary probes is separated from the target strand, it will : 
hybridize with the third and fourth probes which can be ligated to form a complementary, 

.5 secondary ligated product It is important to realize that the ligated products are functionally 
equivalent to either the target or its complement By repeated cycles of hybridization and 
ligation, amplification of the target sequence is achieved This technique is described more 
coriipletely in EP-A-320 308 to K. Backman published Jun. 16, 1989 and EP-A-0439182 to K. 
Backman et a/., published Jul. 31, 1991, both of which are incorporated herein by reference. 

0 For amplification of mRNAs, it is within the scope of the present invention to reverse 

transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a 
single enzyme for both steps as described in US Pat No. 5,322,770, which is incorporated herein 
by reference; or reverse transcribe mRNA into cDNA followed by asymmetric gap ligase chain 
reaction (RT-AGLCR) as described by R. L. Marshall et aL , PGR Methods and Applications 

5 4:80-84 (1994), which also is incorporated herein by reference. 

TMA is described in detail in, e.g. y US Patent No. 5399,491, the disclosure of which is 
incorporated herein by reference in its entirety. In one example of a typical assay, an isolated 
nucleic acid sample, suspected of containing a SARSV target sequence, is mixed with a buffer 
concentrate containing the buffer, salts, magnesium, nucleotide triphosphates, primers, 

3 dithiothreitol, and spermidine. Hie reaction is optionally incubated at about 100°C for 
approximately two minutes to denature any secondary structure. After cooling to room 
temperature, reverse transcriptase, RNA polymerase, and RNAse H are added and the mixture is 
incubated for two to four hours at 37°C. The reaction can then be assayed by denaturing the 
product, adding a probe solution, incubating 20 minutes at 60°C, adding a solution to selectively 

5 hydrolyze the unhybridized probe, incubating the reaction six minutes at 60°C, and measuring 
the remaining chemiluminescence in a luminometer. 
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Generally, TMA includes .the following steps: (a) isolating nucleic acid, including RNA, 
from the biological sample of interest suspected of being infected with SARSV; and (b) 
combining into a reaction mixture (i) the isolated nucleic acid, (ii) first and second 
oligonucleotide primers, the first primer having a complexing sequence sufficiently 
5 complementary to the 3' terminal portion of an RNA target sequence, if present (for example the 
(+) strand), to complex therewith, and the second primer having a complexing sequence 
sufficiently complementary to the 3' terminal portion of the target sequence of its complement 
(for example, the (-) strand) to complex therewith, wherein the first oligonucleotide further 
comprises a sequence 5' to the complexing sequence which includes a promoter, (iii) a reverse 

10 transcriptase or RNA and DNA dependent DNA polymerases, (iv) an enzyme activity which 

selectively degrades the RNA strand of an RNA-DNA complex (such as an RNAse H) and (v) an 
RNA polymerase which recognizes the promoter. 

The components of the reaction mixture may be combined stepwise or at once. The 
reaction mixture is incubated under conditions whereby an oligonucleotide/target sequence is 

15 formed, including DNA priming and nucleic acid synthesizing conditions (including 
ribonucleotide triphosphates and deoxyribonucleotide triphosphates) for a period of time 
sufficient to provide multiple copies of the target sequence. The reaction advantageously takes 
place under conditions suitable for maintaining the stability of reaction components such as the 
component enzymes and without requiring modification or manipulation of reaction conditions 

20 during the course of the amplification reaction. Accordingly, the reaction may take place under 
conditions that are substantially isothermal and include substantially constant ionic strength and 
pH. The reaction conveniently does not require a denaturation step to separate the RNA-DNA 
complex produced by the first DNA extension reaction. 

Suitable DNA polymerases include reverse transcriptases, such as avian myeloblastosis 

15 virus (AMV) reverse transcriptase (available from, e.g., Seikagaku America, Inc.) and Moloney 
murine leukemia virus (MMLV) reverse transcriptase (available from, e.g. , Bethesda Research - 
Laboratories). 

Promoters or promoter sequences suitable for incorporation in the primers are nucleic acid 
sequences (either naturally occurring, produced synthetically or a product of a restriction digest) 

(0 that are specifically recognized by an RNA polymerase that recognizes and binds to that 

sequence and initiates the process of transcription whereby RNA transcripts are produced. The 
sequence may optionally include nucleotide bases extending beyond the actual recognition site 
for the RNA polymerase which may impart added stability or susceptibility to degradation 
processes or increased transcription efficiency . Examples of useful promoters include those 

►5 which are recognized by certain bacteriophage polymerases such as those from bacteriophage 
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T3, T7 or SP6, or a promoter from K coli. These RNA polymerases are readily available from 
commercial sources, such as New England Biolabs and Epicentre. 

Some of the reverse transcriptases suitable for use in the methods herein have an RNAse H 
activity, such as AMV reverse transcriptase. It may, however, be preferable to add exogenous 
5 RNAse H, such as K coli RNAse H, even when AMV reverse transcriptase is used RNAse His 
readily available from, e.g., Bethesda Research Laboratories, 

Hie RNA transcripts produced by these methods may serve as templates to produce 
additional copies of the target sequence through the above-described mechanisms. The system is 
autocatalytic and amplification occurs autocatalytically without the need for repeatedly 
10 modifying or changing reaction conditions such as temperature, pH, ionic strength or the like. 

Detection may be done using a wide variety of methods, including direct sequencing, 
hybridization with sequence-specific oligomers, gel electrophoresis and mass spectrometry, 
these methods can use heterogeneous or homogeneous formats, isotopic or nonisotopic labels, as 
well as no labels at all. 

1 5 Suitable labeling moieties for attachment to primers and/or to probes used in methods of 

the invention include, but are not limited to: 5-FAM (also called 5-carboxyfluorescein; also 
called Spiro(isobenzofuran~l(3H), 9 -(9H)xanthene)-5-caiboxylic arid,3\6-dihydroxy-3-oxo-6- 
carboxyfluorescein); 5-Hexachloro-Fluorescein ([4,7,2\4\5\7 f -hexacMoro-(3\6 f - 
dipivaloylfluoresceinyl>6-carboxylic acid ]); 6-HexacMoro-Huorescein ([4,7,2 , ,4\5 , ,7- 

20 hexacUoro^3\6 , -dipivaloylfluoresceinyl)-5-carboxylic acid ]); 5-Tetrachloro-Fluorescein 
([4,7^^7 r -tetI^UoroK3^6 , -dipivaloylfluoresceinyl>•5- carboxylic acid ]); 6-Tetrachloro- 
Huorescein ([4,7^\7 , -tetracWoro<3\6 , ^pivaloylfluoresceinyl)-6- carboxylic acid ]); 
tetramethylrhodamines (TAMRA), including (i) 5-TAMRA (5-carboxytetramethylrfiodamine; 
Xanthylium, 9<2,4-dicarboxyphenyl)-3,6- bis(dimethylamino) and (ii) 6-TAMRA (6- 

15 carboxytetramethylrhodamine; Xanthylium, 9-(2,5-dicarboxyphenyl)-3,6- bis(dimethylamino); 
EDANS (5K(2-aminoethyl)amino)naphthalene- 1-sulfonic acid) ; 1,5-IAEDANS (5-((((2- 
iodoacetyl)amino)ethyl) amino)naphthalene-l-sulfonic acid); DABCYL (4-((4- 
(dimethylamino)phenyl) azo)benzoic aicid); Cy5 (bidodicarbocyanine-5); Cy3 
(Indodicarbocyanine-3); and BODIPY™ EL (4,4-difluon>-5,7Klimethyl-4-bora-3a,4a-diaza"S- 

10 mdacene-3-propionic acid). Labelling of probes with both FAM (e.g. at 5 1 ) and TAMRA (e.g. at 
3 1 ) is preferred 

Nucleic acids of the invention may be used in solution or may be bound to a solid matrix or 
support e.g. in the format of a DNA array, 

As is readily apparent, design of the assays described herein are subject to a great deal of 
•5 variation, and many formats are known in the art The above descriptions are merely provided as 
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guidance and one of skill in the art can readily modify the described protocols, using techniques 
well known in the art. 

One 302nt amplicon of the SARS virus is known as 'T3NI-1" (SEQ ID NO: 9927). It was 
sequenced at the Bemhard Nocht Institute, Hamburg, Germany. In April 2003 the BNI-1 

5 sequence was published on the WHO website (http://www.who.Wcsr/sars^rbners/en^ and in 

Dorsten et a£, "Identification of a Novel Coronavirus in Patients with Severe Acute Respiratory 
Syndrome", New England Journal of Medicine, published online at http://www.nejm.org. Both 
references are incorporated herein by reference in their entirety. Some embodiments of the 
invention do not encompass a nucleic acid consisting of SEQ ID NO: 9927. Some other 

10 embodiments of the invention do not encompass a nucleic acid comprising SEQ ID NO: 9927. 
Some embodiments of the invention do not encompass a polypeptide consisting of any one of 
SEQ ID NO s : 9928 to 9959. Some other embodiments of the invention do not encompass a 
nucleic acid comprising any one of SEQ ID NO s : 9928 to 9959. Some embodiments of the 
invention are not subject to these exclusions. 

.5 Immunoassays 

The present invention utilizes various immunoassay techniques for identifying individuals 
exposed to SARSV and/or biological samples containing SARSV antigens or antibodies to 
SARSV. 

Immunoassay Formate 

0 The SARSV antigens may be employed in virtually any assay format that employs a 

known antigen to detect antibodies. A common feature of all of these assays is that the antigen is 
contacted with biological sample suspected of containing SARSV antibodies under conditions 
that permit the antigen to bind to any such antibody present in the component Such conditions 
will typically be physiologic temperature, pH and ionic strength using an excess of antigen. The 

5 incubation of the antigen with the specimen is followed by detection of immune complexes 
comprised of the antigen. Alternatively, anti-SARS V antibodies may be employed to detect the 
presence of SARSV antigens in a biological sample. Combination antigen/antibody assays are 
also contemplated; for example, as described for HCV detection in US patent 6,63038. 

Design of the immunoassays is subject to a great deal of variation, and many formats are 

3 known in the art. Protocols may, for example, use solid supports, or immunoprecipitation. Most 
assays involve the use of labeled antibody or polypeptide; the labels may be, for example, 
enzymatic, fluorescent, chemiluminescent, radioactive, or dye molecules. Assays which amplify 
the signals from the immune complex are also known; examples of which are assays which 
utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such as ELB A 

) assays. 
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Hie immunoassay may be, without limitation, in a heterogeneous or in a homogeneous 
format, and of a standard or competitive type. In a heterogeneous format, the polypeptide is 
typically bound to a solid matrix or support to facilitate separation of the sample from the 
polypeptide after incubation. Examples of solid supports that can be used are nitrocellulose (e.g., 

5 in membrane or microtiter well form), polyvinyl chloride (e.g., in sheets or microliter wells), 
polystyrene latex (e.g., in beads or microtiter plates, polyvinylidine fluoride, diazotized paper, 
nylon membranes, microchips, high or low density biochips, recombinant immunoassays 
(RIB A), micro fluidity devices, micromagnetic beads, activated beads, and Protein A beads. For 
example, Dynatech Immunlon or Immunlon 2 microtiter plates or 0.23 inch polystyrene beads 

10 (Precision Plastic Ball) can be used in the heterogeneous format The solid support containing 
the antigenic polypeptides is typically washed after separating it from the test sample, and prior 
to detection of bound antibodies. Both standard and competitive formats are known in the art 

In a homogenous format, the test sample is incubated with the combination of antigens in 
solution. For example, it may be under conditions that will precipitate any antigen-antibody 

15 complexes which are formed Both standard and competitive formats for these assays are known 
in.theart 

In a standard format, the amount of SARSV antibodies in the antibody-antigen complexes 
is directly monitored. This may be accomplished by determining whether labeled anti- 
xenogeneic (e.g. 7 anti-human) antibodies which recognize an epitope on anti-SARSV antibodies 
10 will bind due to complex formation. In a competitive format, the amount of SARSV ahtibodies 
in the sample is deduced by monitoring the competitive effect on the binding of a known amount 
of labeled antibody (or other competing ligand) in the complex. 

Complexes formed comprising anti-SARSV antibody (or in the case of competitive assays, 
the amount of competing antibody) are detected by any of a number of known techniques, 
55 depending on the format. For example, unlabeled SARSV antibodies in the complex may be 
detected using a conjugate of antixenogeneic Ig complexed with a label, (e.g. 9 an enzyme label). 

In an immunoprecipitation or agglutination assay format the reaction between the SARSV 
antigens and the antibody forms a network that precipitates from the solution or suspension and 
forms a visible layer or film of precipitate. If no anti-SARSV antibody is present in the test 
►0 specimen, no visible precipitate is formed. 

There are at least three specific types of particle agglutination (PA) assays. These assays 
are used for the detection of antibodies to various antigens when coated to a support One type of 
this assay is the hemagglutination assay using red blood cells (RBCs) that are sensitized by 
passively adsorbing antigen (or antibody) to the RBC. The addition of specific antigen antibodies 
5 present in the body component, if any, causes the RBCs coated with the purified antigen to 
agglutinate. 
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To eliminate potential non-specific reactions in the hemagglutination assay, two artificial 
earners may be used instead of RBC in the PA. The most common of these axe latex particles. 
However, gelatin particles may also be used. The assays utilizing either of these camera are 
based on passive agglutination of the particles coated with purified antigens. 
5 The SARSV antigens will typically be packaged in the form of a kit for use in these 

immunoassays. The kit will normally contain in separate containers the native SARSV antigen, 
control antibody formulations (positive and/or negative), labeled antibody when the assay format 
requires same and signal generating reagents (e.g., enzyme substrate) if the label does not 
generate a signal directly. Hie native SARSV antigen may be already bound to a solid matrix or 

10 separate with reagents for binding it to the matrix. Instructions (e.g., written, tape, CD-ROM, 
etc.) for carrying out the assay usually will be included in the kit 

Immunoassays that utilize the native SARSV antigen are additionally useful in screening 
blood for the preparation of a supply from which potentially infective SARSV is lacking. The 
method for the preparation of the blood supply comprises the following steps. Reacting a body 

15 component, preferably blood or a blood component, from the individual donating blood with 
native SARSV antigen to allow an immunological reaction between SARSV antibodies, if any, 
and the SARSV antigen. Detecting whether anti-SARS V antibody-SARSV antigen complexes 
are formed as a result of the reacting. Blood contributed to the blood supply is from donors that 
do not exhibit antibodies to the native SARSV antigens. 

£0 Production of Antibodies 

As explained above, the assay may utilize various antibodies which may be bound to a 
solid support, and that detect antigen or antigen/antibody complexes formed when SARSV 
infection is piresent in the sample. These antibodies may be polyclonal or monoclonal antibody 
preparations, monospecific antisera, human antibodies, or may be hybrid or chimeric antibodies, 

IS such as humanized antibodies, altered antibodies, F(ab*)2 fragments, F(ab) fragments, Fv 
fragments, single-domain antibodies, dimeric or trimeric antibody fragment constructs, 
minibodies, or functional fragments thereof which bind to the antigen in question. 

Antibodies are produced using techniques well known to those of skill in the art and 
disclosed in, for example, US Pat Nos. 4,011,308; 4,722,890; 4,016,043; 3,876,504; 3,770,380; 

0 and 4,372,745. For example, polyclonal antibodies are generated by immu nizing a suitable 
animal, such as a mouse, rat, rabbit, sheep or goat, with an antigen of interest. In order to 
enhance immunogenicity, the antigen can be linked to a carrier prior to immunization. Such 
carriers are well known to those of ordinary skill in the art Immunization is generally performed 
by mixing or emulsifying the antigen in saline, preferably in an adjuvant such as Freund's 

5 complete adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously 
or intramuscularly). The animal is generally boosted 2-6 weeks later with one or more injections 
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of the antigen in saline, preferably using Freund's incomplete adjuvant Antibodies may also be 
generated by in vitro immunization, using methods known in the art Polyclonal antiserum is 
then obtained from the immunized animal. 

Monoclonal antibodies are generally prepared using the method of Kohler & Milstein 
(1975) Nature 256:495-497, or a modification thereof, as described above. 

As explained above, antibody fragments which retain the ability to recognize the antigen of 
interest, will also find use in the subject immunoassays. A number of antibody fragments are 
known in the art which comprise antigen-binding sites capable of exhibiting immunological 
binding properties of an intact antibody molecule. For example, functional antibody fragments 
can be produced by cleaving a constant region, not responsible for antigen binding, from the 
antibody molecule, using e.g., pepsin, to produce Ffab^ fragments. These fragments will contain 
two antigen binding sites, but lack a portion of the constant region from each of the heavy 
chains. Similarly, if desired, Fab fragments, comprising a single antigen binding site, can be 
produced, e.g., by digestion of polyclonal or monoclonal antibodies with papain. Functional 
fragments, including only the variable regions of the heavy and light chains, can also be 
produced, using standard techniques such as recombinant production or preferential proteolytic 
cleavage of immunoglobulin molecules. These fragments are known as Fv. See, e.g. , Libar et al. 
(1972) Proc. Nat Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) Biochem 15:2706- 
2710; andEhrlich etal. (1980) Biochem 19:4091-4096. 

A single-chain Fv ("sFv" or "scFv") polypeptide is a covalently linked VH-V L heterodimer 
which is expressed from a gene fusion including Vh- and V L -encoding genes linked by a peptide: 
encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. USA 85:5879-5883. A number of 
methods have been described to discern and develop chemical structures (linkers) for converting 
the naturally aggregated, but chemically separated, light and heavy polypeptide chains from an 
antibody V region into an sFv molecule which will fold into a three dimensional structure 
substantially similar to the structure of an antigen-binding site. See, e.g., US Pat Nos. 5,091,513, 
5,132,405 and 4,946,778. The sFv molecules may be produced using methods described in the 
art See, e.g., Huston et al (1988) Proc. Nat Acad. Sci. USA 85:5879-5883; US Pat Nos. 
5,091,513, 5,132,405 and 4,946,778. Design criteria include determining the appropriate length 
to span the distance between the C-terminus of one chain and the N-terminus of the other, 
wherein the linker is generally formed from small hydrophilic amino acid residues that do not 
tend to coil or form secondary structures. Such methods have been described in the art See, e.g., 
US Pat Nos. 5,091,513, 5,132,405 and 4,946,778. Suitable linkers generally comprise 
polypeptide chains of alternating sets of glycine and serine residues, and may include glutamic 
acid and lysine residues inserted to enhance solubility. 
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"Mini-antibodies" or "minibodies" will also find use with die present invention. 
Minibodies are sFv polypeptide chains which include oligomerization domains at their C- 
tennini, separated from the sFv by a hinge region. Pack et aL (1992) Biochem 31:1579-1584. 
Ihe oligomerization domain comprises self-associating a-helices, e.g., leucine zippers, that can 
5 be further stabilized by additional disulfide bonds. Ihe oligomerization domain is designed to be 
compatible with vectorial folding across a membrane, a process thought to facilitate in vivo 
folding of the polypeptide into a functional binding protein. Generally, minibodies are produced 
using recombinant methods well known in the art See, e.g., Pack et aL (1992) Biochem 
31:1579-1584; Cumber etaL (1992) J. Immunology 149B: 120-126. 

10 Production of SARS Antigens 

The S ARS V antigens used in the present invention are generally produced recombinantly. 
Thus, polynucleotides encoding SARS V antigens for use with the present invention can be made 

using standard techniques of molecular biology. For example, polynucleotide sequences coding 

« 

for the above-described molecules can be obtained using recombinant methods, such as by 
.5 screening cDNA and genomic libraries from cells expressing the gene, or by deriving the gene • 
from a vector known to include the same. Furthermore, the desired gene can be isolated directly 
firom viral nucleic acid molecules, using techniques described in the art, such as those described 
for HCV in Houghton et aL, US Pat No. 5,350,671. The gene encoding the antigen of interest 
can also be produced synthetically, rather than cloned. The molecules can be designed with 
O appropriate codons for the particular sequence (preferably optimum codons for the expression 
host of choice). The complete sequence is then assembled from overlapping oligonucleotides 
prepared by standard methods and assembled into a complete coding sequence. See, e.g. , Edge 
(1981) Nature 292:756; Nambair et aL (1984) Science 223:1299; and Jay et aL (1984) J. Biol 
Chem. 259:6311. 

5 Thus, particular nucleotide sequences can be obtained from vectors harboring the desired 

sequences or synthesized completely or in part using various oligonucleotide synthesis 
techniques known in the art, such as site-directed mutagenesis and polymerase chain reaction 
(PCR) techniques where appropriate. See, e.g., Sambrook, supra. In particular, one method of 
obtaining nucleotide sequences encoding the desired sequences is by annealing complementary 

) sets of overlapping synthetic oligonucleotides produced in a conventional, automated 
polynucleotide synthesizer, followed by ligation with an appropriate DNA ligase and 
amplification of the ligated nucleotide sequence via PCR. See, e.g., Jayaraman et aL (1991) 
Proc. Natl. Acad. Sci. USA 88:4084-4088. Additionally, oligonucleotide directed synthesis 
(Jones et al. (1986) Nature 54:75-82), oligonucleotide directed mutagenesis of pre-existing 

> nucleotide regions (Riechmann et al. (1988) Nature 332:323-327 and Verhoeyen et aL (1988) 
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Science 239:1534-1536), and enzymatic filling-in of gapped ohgonucleotides using T4DNA 
polymerase (Queen et at (1989) Proc. Natl Acad. Sci USA 86:10029-10033) can be used under 
the invention to provide molecules having altered or enhanced antigen-binding capabilities, 
and/or reduced immunogenicity. 

Once coding sequences have been prepared or isolated, such sequences can be cloned into 
any suitable vector or repKcon. Numerous cloning vectors are known to those of skill in the art, 
and the selection of an appropriate cloning vector is a matter of choice. Suitable vectors include, 
but are not limited to, plasmids, phages, transposes, cosmids, chromosomes (including artificial 
chromosomes, such as BACs or YACs) or viruses which are capable of replication when 
associated with the proper control elements. 

The coding sequence is then placed under the control of suitable control elements, 
depending on the system to be used for expression. Thus, the coding sequence can be placed 
under the control of a promoter, ribosome binding site (for bacterial expression) and, optionally, 
an operator, so that the DNA sequence of interest is transcribed into RNA by a suitable 
transforrhant. The coding sequence may or may not contain a signal peptide or leader sequence 
which can later be removed by the host in post-translational processing. See, e.g., US PaL Nos. 
4,431,739; 4^425,437; 4338397. 

In addition to control sequences, it may be desirable to add regulatory sequences which ' . 
allow for regulation of the expression of the sequences relative to the growth of the host cell. 
Regulatory sequences are known to those of skill in the art, and examples include those which 
cause the expression of a gene to be turned on or off in response to a chemical or physical 
stimulus, including the presence of a regulatory compound. Other types of regulatory elements 
may also be present in the vector. For example, enhancer elements may be used herein to 
increase expression levels of the constructs. Examples include the SV40 early gene enhancer 
(Dijkemaeroi (1985) 4:761), the enhancer/promoter derived from the long terminal 

repeat (LTR) of the Rous Sarcoma Virus (Gorman et at (1982) Proc. Natl. Acad. Sci. USA 
79:6777) and elements derived from human CMV (Boshart et at (1985) Cell 41:521), such as 
elements included in the CMV intron A sequence (US Pat No. 5,688,688). The expression 
cassette may further include an origin of replication for autonomous replication in a suitable host 
cell, one or more selectable markers, one or more restriction sites, a potential for high copy 
number and a strong promoter. 

An expression vector is constructed so that the particular coding sequence is located in the 
vector with the appropriate regulatory sequences, the positioning and orientation of the coding 
sequence with respect to the control sequences being such that the coding sequence is transcribed 
under the "control" of the control sequences (ie., RNA polymerase which binds to the DNA 
molecule at the control sequences transcribes the coding sequence). Modification of the 
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sequences encoding the molecule of interest may be desirable to achieve this end For example, 
in some cases it may be necessary to modify the sequence so that it can be attached to the control 
sequences in the appropriate orientation; Le., to maintain the reading frame. The control 
sequences and other regulatory sequences may be ligated to the coding sequence prior to 
5 insertion into a vector. Alternatively, the coding sequence can be cloned directly into an 

expression vector which already contains the control sequences and an appropriate restriction 
site. 

As explained above, it may also be desirable to produce mutants or analogs of the antigen 
of interest Methods for doing so are described in, e.g., Dasmahapatra et al., US Pat No. 

10 5,843,752 and Zhang etal.,US Pat No. 5,990,276. Mutants or analogs of SARSV proteins for 
use in the subject assays may be prepared by the deletion of a portion of the sequence encoding 
the polypeptide of interest, by insertion of a sequence, and/or by substitution of one or more 
nucleotides within the sequence. Techniques for modifying nucleotide sequences, such as site- 
directed mutagenesis, and the like, are well known to those skilled in the art. See, e.g., Sambrook 

.5 et al, supra; Kunkel, T. A. (1985) Proc. Natl Acad. ScL USA (1985) 82:448; Geisselsoder etaL 
(1987) BioTeclmiques 5:786; Zoller & Smith (1983) Methods Enzymol. 100:468; Dalbie- 
McFarland et aL (1982) Proc. Natl. Acad Sci USA 79:6409. 

The molecules can be expressed in a wide variety of systems, including insect, 
mammalian, bacterial, viral and yeast expression systems, all well known in the art 

0 For example, insect cell expression systems, such as baculovirus systems, are known to 

those of skill in the art and described in, e.g. , Summers & Smith, Texas Agricultural Experiment 
Station Bulletin No. 1555 (1987). Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form from, inter alia, Invitrogen, San Diego Calif. 
("MaxBac" kit). Similarly, bacterial and mammalian cell expression systems are well known in 

5 the art and described in, e.g., Sambrook et al, supra. Yeast expression systems are also known in . 
the art and described in, e.g., Yeast Genetic Engineering (Barr et aL, eds., 1989) Butterworths, 
London. 

A number of appropriate host cells for use with the above systems are also known. For 
example, m a mmali an cell lines are known in the art and include immortalized cell lines available 

3 from the American Type Culture Collection (ATCC), such as, but not limited to, Chinese 

hamster ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells 
(COS), human embryonic kidney cells, human hepatocellular carcinoma cells (e.g., Hep G2), 
Madin-Darby bovine kidney ("MDBK n ) cells, as well as others. Similarly, bacterial hosts such as 
Rcoli, Bacillus subtilis, and Streptococcus spp., will find use with the present expression 

5 constructs. Yeast hosts useful in the present invention include inter alia, Saccharomyces 

cerevisiae, Candida albicans, Candida maltosa, Hansenula polymorpha, Kluyveromyces fragilis, 
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Kluyveromyces lactis, Pichia guillerimondii, Pichia pastoris, Schizosaccharomyces pombe and 
Yarrowia lipolytics Insect cells for use with baculovirus expression vectors include, inter alia, 
Aedes aegypti, Autographa califormica, Bombyx mori, DrosophUa melanogaster, Spodoptera 
frugiperda, and Trichoplusia ni. 

Nucleic acid molecules comprising nucleotide sequences of interest can be stably 
integrated into a host cell genome or maintained on a stable episomal element in a suitable host 
cell using various gene delivery techniques well known in the art See, e.g., US Pat No. 
5399346. 

Depending on the expression system and host selected, the molecules arc produced by 
growing host cells transformed by an expression vector described above under conditions 
whereby the protein is expressed. The expressed protein is then isolated from the host cells and 
purified If the expression system secretes the protein into growth media, the product can be 
purified directly from the media. If it is not secreted, it can be isolated from cell lysates. The 
selection of the appropriate growth conditions and recovery methods are within the skill of the 
art. 

EXAMPLE 

For useful expression of SARS V antigens in Saccharomyces cerevisiae and Pichia 
pastoris, insect cells, and mammalian cells, the following domains are cloned into expression 
vectors as listed in the Table below. The nt sequence numbers are from the SARS V sequence of 
SEQIDNO:l. 

RNA polymerase la: SARS nt 250-13398 
RNA polymerase lb: SARS nt 13399-21470 

ORFns.envelope (homologous to ns2, hemagglutinin-esterase envelope glycoprotein, 
and spike glycoprotein): SARS nt 21477-25244 

Membrane: SARS nt 27849 - 28103 

Nucleocapsid: SARS nt 28105 - 29373 
A combination of PCR and synthetic oligos is used to create the above domains with restriction 
sites tailored to the following expression vectors: 



Restriction ends 


Vector 


Promoter 


HindHUSall 


pBS24.1 


ADH2/GAPDH 


EcoBJf&m 


pBS24.1 


ADH2/GAPDH/SOD fusion 


XbdVSall 


pA0815 


AOXI 


EcoSJ/BamHL 


pCMVkm2 


CMVp/EnhanceiyintronA 


EcdRI/Xmdl 


pCMVm 


CMVp/Bnhancer/IhtronA 


NheVSaH 


pBluBac4.5 


Polyhedrin 



Expression host 

AD3/Saccharomyces 

AD3/Saccharomyces 

GS115/Pichia pastoris 

HVK-293/Transient 

transfection 

CHO stable cell line 

Cell lines employed by Chiron 

include: Sf9, S£21,Tn5 
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IV. TREATMENT OF SARS INFECTION WITH RNAi 

RNA interference or "RNAi" is a term initially coined by Fire and co-workers to describe 
the observation that double-stranded RNA (dsRNA) can block gene expression when it is 
introduced into worms (Fire et aL, Nature 391, 806-811(1998)). RNAi most likely involves 

5 mRNA degradation, resulting in sequence-specific, post-trariscriptional gene silencing in many 
organisms. RNAi is a post-transcriptional process triggered by the introduction of double- 
stranded RNA which leads to gene silencing in a sequence-specific manner. RNAi has been 
reported to occur naturally in organisms as diverse as nematodes, trypanosmes, plants and fungi. 
It most likely serves to protect organisms from viruses, modulate transposon activity and 

10 eliminate aberrant transcription products. 

Hie first evidence that dsRNA could achieve efficient gene silencing through RNAi came 
from studies on the nematode Caenorhabditis elegans (P\ie et aL (1998) Nature, 391:806-811 
and US Patent No. 6,506,559). Later studies in the fruit fly Drosophila melanogaster 
demonstrated that RNAi is a two-step mechanism (Elbashir et aL (2001) Genes Dev., 15(2): 188- 

15 200). First, long dsRNAs are cleaved by an enzyme known as Dicer in 21-23 nucleotides (nt) 
fragments, called small interfering RNAs (siRNAs). Then, siRNAs associate with a ribonuclease 
complex (termed RISC for RNA Induced Silencing Complex) which target this complex to 
complementary mRNAs. RISC then cleaves the targeted mRNAs opposite the complementary 
siRNA, which makes the mRNA susceptible to other RNA degradation pathways. 

20 RNAi is the phenomenon where dsRNA corresponding to a targeted DNA or RNA 

sequence can suppress or silence gene expression. Even though dsRNA can mediate gene- 
specific interference in mammalian cells in some circumstances (Wianny & Zernicka-Goetz 

(2000) Nature Cell Biol 2:70-75; Svoboda et al (2000) Development 17:4147-4156) the use of 
RNAi in mammalian somatic cells is often limited due to the dsRNA triggering dsRNA- 

25 dependent protein kinase (PER) which in turn inactivates translation factor eIF2a and causes a 
generalized suppression of protein synthesis and often times apoptosis (Gil & Esteban (2000) 
Apoptosis 5: 107-1 14). 

Recently, gene-specific suppression using siRNA of approximately 21 or 22 base pairs in 
length, corresponding to targeted RNA or DNA sequences, were shown to disrupt the expression 
30 of these targeted sequences in mammalian cells (Elbashir, SM., et al., Nature 411: 494-498 

(2001) ). However, it is not clear that all RNA or DNA sequences of a mammalian cell's genome 
are susceptible to siRNA. It is also uncertain that every mammalian cell type possesses the 
necessary machinery for effecting gene-specific suppression using siRNA. Further, siRNA is of 
limited use for at least two reasons: the transient nature of the suppression effect seen in cells 

35 where the siRNA has been administered; and in some instances the necessity for chemical 

synthesis of siRNAs before their use (Tuschl T., Nature BiotechnoL, 20: 446-448 (2002)). Also 
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the instability of these short, synthetic RNAs makes it presents problems for any long torn use of 
these siRNAs a pharmaceutical. 

To overcome this limitation, the present invention provides a modified siRNA with 
increased stability against nuclease degradation while still maintaining its ability to inhibit viral 
5 replication via RNA interference. Such modification to the ribonucleotides in the siRNAs, adds 
a chemical group via chemical synthesis or in vitro transcription or longer modified RNAs can 
be prepared by either of these methods and cut into siRNAs using Dicer. 

Although other methods for gene-specific suppression have utilized chemically-modified 
nucleic acids, such as antisense and ribozyme technology, such modification destroys critical 
10 enzymatic activities necessary for the function of these technologies. In regard to antisense 
technology, modification of the ribonucleotides destroys RNaseH activity, whereas such 
. modification abolishes the catalytic activity of ribozymes. 

The present invention provides a double-stranded RNA (dsRNA) molecule modified for 
protection against nuclease degradation with a length from about 10 to about 30 nucleotides 
15 which is able to inactivate a virus in a mammalian cell. Hie invention also provides a method of 
inactivating a virus by administering modified small interfering RNAs (siRNAs) that are 
modified so that they are nuclease or RNase resistant and retain the biological activity of being 
able to inhibit viral replication by targeting a RNA sequence in a virus. 

, The invention is further directed to a method of making modified siRNAs that target a 
10 RNA sequence in a virus comprising preparing a modified-double stranded RNA (dsRNA) 
fragment containing at least one modified ribonucleotide in at least one strand that spans the 
genome of the virus; and cleaving the modified-dsRNA fragments with recombinant human 
Dicer resulting in more than one modified siRNA. 

The present invention provides a modified dsRNA molecule of from about 10 to about 30 
15 nucleotides which mediates targeted RNA interference in hepatic or SARS-infected cells. 

As used herein RNA interference, or RNAi, is used to mean sequence-specific, or gene 
specific, suppression of gene expression (protein synthesis), without causing a generalized 
suppression of protein synthesis in cells harboring the siRNA. The invention is not limited to a 
particular theory of the mechanism of action of RNAi. For example, RNAi may involve 
10 degradation of messenger RNA (mRNA) in an RNA-induced silencing complex (RISC), 

preventing translation of the transcribed mRNA, or it may involve the methylation of genomic 
DNA, shunting transcription of the gene. The lack of gene expression caused by RNAi may be 
transient, lasting a short period of time, or it may be stable, or permanent, lasting an indefinite 
period of time. 

'& The term RNA is meant as is recognized in the art Further, as used herein, RNA is used to 

mean double-stranded RNA (dsRNA) or single-stranded RNA (ssRNA) or a dsRNA with a 
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single-stranded overhang. dsRNAs within the meaning of the present invention includes short 
interfering RNA (siRNA), micro RNA (miRNA) and small hairpin RNA (shRNA), 
Additionally, RNA is also used to mean messenger RNA (mRNA), transfer RNA (tRNA) or 
ribosomal RNA (rRNA). 
5 The present invention is directed to small interfering RNA (siRNA) which have been 

chemically modified to confer increased stability against nuclease degradation yet these siRNAs 
are still able to bind to target RNAs, that may be present in a cells. In the case where the target 
RNA is a virus specific RNA, the modified siRNAs are able to bind to the virus specific RNAs 
and inactivate the virus. A modified siRNA of the present invention comprises a modified 

10 ribonucleotide, wherein the siRNA is resistant to enzymatic degradation, such as RNase 

degradation, and yet retains the ability to inhibit viral replication. The modified siRNA is more 
specifically modified at the 2' position of the ribose in the siRNA. The modification is at the T 
position of at least one ribonucleotide of said siRNA. Attachment of receptor-binding ligands to : 
siRNA molecules can be used to target the siRNA to a desired cell type. For example, 

15 attachment. of cholesterol at the 5'-end or 3'-end of the siRNA molecule, to give a cholesteryl • 
siRNA, can enhance targeting to hepatocytes. Other ligands for receptor mediated siRNA 
targeting to liver include HBV surface antigen, LDL, and others. 

More specifically, the siRNA is modified at at least one pyrimidine, at least one purine or a 
combination thereof. However, generally all pyrimidines, or all purines or a combination of all 

20 pyrimidines and all purines of the siRNA are modified. More preferably, the pyrimidines are 
modified and these pyrimidines are cytosine, a derivative of cytosine, uracil, a derivative of 
uracil or a combination thereof. It also is contemplated to modify the selected ribonucleotides in* 
at least one strand of the siRNA or the ribonucleotides in both strands of the siRNA are 
modified 

15 The nucleotides containing pyrimidine bases found in RNA (cytidine and uridine) can be 

chemically modified by adding any molecule that inhibits RNA degradation or breakdown to the 
2' position of the ribose molecule. The 2-modified pyrimidine nucleotide can be formed using a 
number of different methods. The 2' modification confers increased stability to the siRNA by - 
making the siRNA impervious or resistant to nuclease activity. Thus, the 2' modified siRNA has 

10 a longer serum half-life and is resistant to degradation compared to unmodified siRNA. The 
siRNA also may be modified completely or partially. 

Regarding chemical modification of siRNAs, a molecule from the halide chemical group is 
preferably added to the ribonucleotide of the siRNA. Within the halides, fluorine is the preferred 
molecule but other chemical molecules, in addition to fluoro-, such as methyl-, methoxyethyl- 

15 and propyl-modifications can also we made. But the preferred modications is fluoro- 

modification, such as a 2-fluoro-modication or a 2',2'-fluoro~modification.Thus, in a preferred 
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embodiment of the invention, the siRNA is modified by adding a fluorine molecule to the 2' 
caibon of the pyrimidine ribonucleotide. The siRNA may be fluorinated completely or partially. 
For example, only the cytosine nucleotides need be fluorinated. Alternatively, only the uracil 
nucleotide need be fluorinated but both uracil and cytosine can be fluorinated Furthermore, 
5 only one strand, either sense or antisense, of the siRNA can be fluorinated Even partial 2' 
fluorination the siRNA gives protection against nucleolytic degradation. Furthermore, it is 
important to note the 2' fluorinated siRNA is not toxic to cells, an unexpected result given that 
fluorine chemistry usually is toxic to living organisms. 

The siRNA of the present invention is designed to interact with a target nucleotide 

10 sequence. Most preferably this target nucleotide sequence is a disease producing agent or 

pathogen of which one wishes to inhibit gene expression. More preferably, this target nucleotide 
sequence is in a virus genome, and further this virus genome is from a RNA virus or a DNA 
virus is selected from the group consisting of hepatitis C virus (HCV), hepatitis A virus, hepatitis 
B virus, hepatitis D virus, hepatitis E virus, Ebola virus, influenza virus, rotavirus, reovirus, 

15 retrovirus, poliovirus, human papilloma virus (HFV), metapneumovirus and coronavixus. The 
most preferred virus is SARS virus. 

Modfied siRNA may be prepared in a number of ways, such as by chemical synthesis, T7 
polymerase transcription, or by treating modified long double stranded RNA (dsRNA) prepared 
by one of the two previous methods with Dicer enzyme. Dicer enzyme can be used to cleave 

iO dsRNA that is about 500 base pairs to about 1000 base pairs in size, to created mixed 
populations of dsRNA from about 21 to about 23 base pairs in length. Furthermore, an 
unexpected result of using the Dicer enzyme method is that Dicer enzyme will cleave modified 
strands of dsRNA, such as 2' fluorinated modified dsRNA. Before development of this method, 
it was previously thought that Dicer would not be able to cleave modified siRNA. The Dicer 

15 method can be carried out using the Dicer siRNA Generation Kit available from Gene Therapy 
Systems, San Diego, CA. 

As used herein, small interfering RNA (siRNA) is defined as double- or single-stranded 
RNA of from about 10 to about 30 nucleotides in length, more preferably 12-28 nucleotides, 
more preferably 15-25 nucleotides, even more preferably 19-23 nucleotides and most preferably 

JO 21-23 nucleotides. The length of a siRNA as used herein, is determined by the length of one of 
the strands of the RNA. For example, a siRNA that is described as 21 nucleotides long (a 21- 
mer) may comprise two opposite strands of RNA which anneal together for 19 contiguous base 
pairings. Hie two remaining nucleotides on one end of the molecule would not anneal to the 
opposite strand, thus creating an "overhang". The overhang can be at the 5' or the 3' end of the 

\5 dsRNA. Preferably, the overhang is at the 3' end of the RNA strand. The length of a double- 
stranded RNA where the two opposite strands are not the same length will be designated by the 
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longer of the two strands. For example, a dsRNA comprising one strand which is 21 nucleotides 
long and anneals to an opposite strand that is 20 nucleotides long, will be considered, as used 
herein, a 21-mer. 

Preferably, the siRNA of the present invention will comprise a 3' overhang of about 2 to 4 
5 bases. More preferably, the 3 1 overhang is 2 nucleotides long. Even more preferably, the 2 
nucleotides comprising the 3' overhang are uridine (U). 

In one embodiment, the invention provides an RNA molecule comprising a nucleotide 
sequence at least 80% identical to the nucleotide sequence of the target agent or virus. 
Preferably, the RNA molecule of the present invention is at least 90%, 95%, 96%, 97%, 98%, 
10 99% or 100% identical to the nucleotide sequence of the target agent or virus. 

As a practical matter, whether any particular nucleic acid molecule is at least 90%, 95%, 
96%, 97% 98%, 99% or 100% identical to the nucleotide sequence of the target agent or virus 
can be determined conventionally using known computer programs such as the Bestfit program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, 
15 University Research Park, 575 Science Drive, Madison, Wis. 53711). Bestfit uses the local 
homology algorithm of Smith & Waterman (Advances in Applied Mathematics 2:482-489 
(1981)) to find the best segment of homology between two sequences. When using Bestfit or any 
other sequence alignment program to determine whether a particular sequence is, for instance, 
95% identical to a reference sequence according to the present invention, the parameters are set, s 
10 of course, such that the percentage of identity is calculated over the full length of the reference 
nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in ; 
the reference sequence are allowed. 

The present invention provides a method of inactivating a target agent or preferably a virus 
in a. patient comprising administering to the patient a modified siRNA in an effective amount to 
15 inactivate the targeted agent or virus. RNA interference towards a targeted DNA segment in a 
cell can be achieved by administering a dsRNA molecule or siRNA to the cells, wherein the 
nucleotide sequence of the dsRNA molecule corresponds to the nucleotide sequence of the 
targeted DNA segment Preferably, the RNA molecule used to induce targeted RNAi is siRNA. 
Gene suppression, targeted suppression, sequence-specific suppression, targeted RNAi or 
0 sequence-specific RNAi are used interchangeably herein. Furthermore, sequence-specific 
suppression, as used herein, is determined by separately assaying the levels of the protein 
targeted for suppression in cells containing the siRNA (experimental cells) and in cells not 
containing the identical siRNA (control cells), and comparing the two values. Furthermore, the 
experimental and control cells must be derived from the same source and same animal. For 
5 example, the control and experimental cells can be, but are not limited to, normal human hepatic 
cells as cell culture in vitro, or they can derived from a hepatocellular carcinoma. Further, the 
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control and experimental cells used in determining the level or quantity of gene suppression must 
be assayed under similar, if not identical, conditions. 

As used herein the phrase "targeted DNA segment" is used to mean a DNA sequence 
encoding, in whole or in part, an mRNA for a targeted protein, including introns or exons, where 
5 suppression is desired. DNA segment can also mean a DNA sequence that normally regulates 
expression of the targeted protein, including but not limited to the promoter of the targeted 
protein. Furthermore, the DNA segment may or may not be a part of the cell's genome or it may 
be ex trachromo somal , such as plasmid DNA. 

Hie present invention is further directed to inactivating a virus in a patient comprising 

10 administering to a patient a modified siRNA in an effective amount to inactivate the virus. The . 
siRNA is preferably about 10 to about 30 nucleotides in length, more preferably 12-28 
nucleotides, more preferably 15-25 nucleotides, even more preferably 19-23 nucleotides and 
most preferably 21-23 nucleotides. The method preferably utilizes a 2' modified siRNA that is 
modified at the T position of at least one ribonucleotide of said siRNA. The method utilizes a 

15 siRNA that is modified with chemical groups selected from the group consisting of fluoro-, 

methyl-, methoxyethyl- and propyl-modification. The fluoro-modification is preferred and either 
a 2'-fluoro-modication or a 2',2 -fluoro-modification is useful in the present invention and 
preferred. 

The modification may be at the pyrimidines, the purines or a combination thereof of the 

JO siRNA are modified. More preferably the pyrimidines are modified, such as cytosine, a 
derivative of cytosine, uracil, a derivative of uracil or a combination thereof. In one ■• 
embodiment, at least one strand of the siRNA contains at least one modified nucleotide and in an 
alternate embodiment, oth strands of the siRNA contains at least one modified nucleotide. 

The method is intended to target disease causing agents or pathogens, an more particularly 

15 viruses, which can be either a RNA virus or a DNA virus, which are selected from the group 
consisting of hepatitis C virus (HCV), hepatitis A virus, hepatitis B virus, hepatitis D virus, 
hepatitis E virus, Ebola virus, influenza virus, rotavirus, reo virus, retrovirus, poliovirus, human 
papilloma virus (HPV), metapneumovirus and coronavirus. More preferably the target virus is a 
S ARS virus. Hie present method utilizes a siRNA prepared by (a) identifying a target nucleotide 

10 sequence in a virus genome, preferably S ARS virus, for designing a small interfering RNA 
(siRNA); and (b) producing a siRNA that has been modified to contain at least one modified 
nucleotide. More preferably, the siRNA comprises a dsRNA molecule with a first strand 
ribonucleotide sequence corresponding to a nucleotide sequence corresponding to a target 
nucleotide sequence in said virus and a second strand comprising a ribonucleotide sequence 

5 complementary to said target nucleotide sequence, wherein said first and second strands are 
separate complementary strands that hybridize to each other to form said dsRNA molecule, and 
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further wherein the first strand ribonucleotide sequence, the second strand ribonucleotide 
sequence or both the first and second strand ribonucletide sequences comprise at least one 
modified nucleotide. In this method, the target nucleotide sequence comprises a conserved 
nucleotide sequence necessary for SARS virus replication, and the conserved nucleotide 
sequence is selected from the group consisting of SEQ ID NO: 7292, SEQ ID NO: 7293, SEQ ID 
NO: 7294, SEQ ID NO: 7295, SEQ ID NO: 7296, SEQ ID NO: 7297, SEQ ID NO: 7298, SEQ 
ID NO: 7299, SEQ ID NO: 7300 and SEQ ID NO: 7301. Preferably, the nucleotide sequence is 
selected from the group consisting of SEQ ID NO: 7292 and SEQ ID NO: 7293. Still more 
preferably, the nucleotide sequence is SEQ ID NO: 7293. 

The siRNA disclosed in this application may be prepared with modified ribonucleotides as 
described herein. Further, the modified ribonucleotide of the siRNA used in the present method 
is incorporated into said siRNA by chemical synthesis or enzymatic synthesis. 

The siRNA disclosed in this application may or may not have a 5' triphosphate group. 
The modified siRNA is administered to a patient by a method selected from the group 
consisting of intravenous injection, subcutaneous injection, oral deUvery, and liposome delivery: 
The modified siRNA accumulates in an organ, tissue or body system of the patient that are the 
liver, gastrointestinal tract, respiratory tract, cervix or skin. 

The present invention also provides a method of inhibiting the replication of a virus, such 
as SARS virus, in cells positive for SARS virus comprising transfecting SARS-positive cells 
with a vector that directs the expression of modified siRNA which is specific for SARS. The 
cells are evaluated to determine if a marker in the cells has been inhibited by the modified 
siRNA. 

The term patient, as used herein, can be an animal, preferably a mammal. More preferably 
the subject can be a primate, including non-human and humans. The terms subject and patient 
can be used interchangeably. 

The treatment envisioned by the current invention can be used for subjects with a pre- 
existing viral infection, or for subjects pre-disposed to a SARS virus infection. Additionally, the 
method of the current invention can be used to correct or compensate for cellular or 
physiological abnonnalities involved in conferring susceptibility to viral infections in patients, 
and/or to alleviate symptoms of a viral infection in patients, or as a preventative measure in 



The method of treating a patient having a viral infection involves adininistration of 
compositions to the subjects. As used herein, composition can mean a pure compound, agent 
substance or a mixture of two or more compounds, agents or substances. As used herein, the 
term agent, substance or compound is intended to mean a protein, nucleic acid, carbohydrate, 
lipid, polymer or a small molecule, such as a drug. 
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In one embodiment of the current invention, the composition administered to the subject is 
a pharmaceutical composition. Further, the pharmaceutical composition can be administered 
orally, nasally, parenterally, intrasystemically, intraperitoneally, topically (as by drops or 
transdermal patch), bucally, or as an oral or nasal spray. Hie term "parenteral," as used herein, 
5 refers to modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrasternal, subcutaneous and intraarticular injection and infusion. Hie pharmaceutical 
compositions as contemplated by the current invention may also include a pharmaceutical^ 
acceptable carrier. 

By "pharmaceutical^ acceptable carrier" is intended, but not limited to, a non-toxic solid, 
10 semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type, 
such as liposomes. 

A pharmaceutical composition of the present invention for parenteral injection can 
comprise pharmaceutical^ acceptable sterile aqueous or nonaqueous solutions, dispersions, 
suspensions or emulsions as well as sterile powders for reconstitution into sterile injectable 

15 solutions or dispersions just prior to use.. Examples of suitable aqueous and nonaqueous carriers, 
diluents, solvents or vehicles include water, ethanol, polyols (such as glycerol, propylene glycol; 
polyethylene glycol, and the like), carboxymethylcellulose and suitable mixtures thereof, 
vegetable oils (such as olive oil), and injectable organic esters such as ethyl oleate. Proper 
fluidity can be maintained, for example, by the use of coating materials such as lecithin, by the • 

10 maintenance of the required particle size in the case of dispersions, and by the use of surfactants. 
The compositions of the present invention can also contain adjuvants such as, but not 
limited to, preservatives, wetting agents, emulsifying agents, and dispersing agents. Prevention 
of the action of microorganisms can be ensured by the inclusion of various antibacterial and 
antifungal agents, for example, paraben, chlorobutanol, phenol, sorbic acid, and the like. It can 

15 also be desirable to include isotonic agents such as sugars, sodium chloride, and the like. 
Prolonged absorption of the injectable pharmaceutical form can be brought about by the 
inclusion of agents which delay absorption such as aluminum monostearate and gelatin. 

In some cases, to prolong the effect of the drugs, it is desirable to slow the absorption from 
subcutaneous or intramuscular injection. This can be accomplished by the use of a liquid 

(0 suspension of crystalline or amorphous material with poor water solubility. Hie rate of 

absorption of the drug then depends upon its rate of dissolution which, in turn, can depend upon 
crystal size and crystalline form. Alternatively, delayed absorption of a parenterally 
administered drug form is accomplished by dissolving or suspending the drug in an oil vehicle. 
Injectable depot forms are made by forming microencapsule matrices of the drug in 

>5 biodegradable polymers such as polylactide-polyglycolide. Depending upon the ratio of drug to 
polymer and the nature of the particular polymer employed, the rate of drug release can be 
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controlled. Examples of other biodegradable polymers include poly(ormoesters) and 
poly(anhydrides). Depot injectable formulations are also prepared by entrapping the drug in 
liposomes or microemulsions which are compatible with body tissues. 

The injectable formulations can be sterilized, for example, by filtration through a bacterial- 
retaining filter, or by incorporating sterilizing agents in the form of sterile solid compositions 
which can be dissolved or dispersed in sterile water or other sterile injectable medium just prior 
to use. 

Solid dosage forms for oral administration include, but are not limited to, capsules, tablets, 
pills, powders, and granules, m such solid dosage forms, the active compounds are mixed with 
at least one item pharmaceutical^ acceptable excipient or carrier such as sodium citrate or 
dicalcium phosphate and/or a) fillers or extenders such as starches, lactose, sucrose, glucose, 
mannitol, and silicic acid, b) binders such as, for example, carboxymethylcellulose, alginates, 
gelatin, polyvinylpyrrolidone, sucrose, and acacia, c) humectants such as glycerol, d) .. 
disintegrating agents such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, 
certain silicates, and sodium carbonate, e) solution retarding agents such as paraffin, f) 
absorption accelerators such as quaternary ammonium compounds, g) wetting agents such as, for 
example, acetyl alcohol and glycerol monostearate, h) absorbents such as kaolin and bentonite 
clay, and i) lubricants such as talc, calcium stearate, magnesium stearate, solid polyethylene 
glycols, sodium lauryl sulfate, and mixtures thereof. In the case of capsules, tablets and pills, the. 
dosage form can also comprise buffering agents. 

Solid compositions of a similar type can also be employed as fillers in soft and hard filled . 
gelatin capsules using such excipients as lactose or milk sugar as well as high molecular weight 
polyethylene glycols and the like. 

The solid dosage forms of tablets, dragees, capsules, pills, and granules can be prepared 
with coatings and shells such as enteric coatings and other coatings well known in the 
pharmaceutical formulating art They can optionally contain opacifying agents and can also be 
of a composition that they release the active ingredients) only, or preferentially, in a certain part 
of the intestinal tract, optionally, in a delayed manner. Examples of embedding compositions 
which can be used include polymeric substances and waxes. 

The active compounds can also be in micro-encapsulated form, if appropriate, with one or 
more of the above-mentioned excipients. 

Liquid dosage forms for oral administration include, but are not limited to, 
phannaceuticaUy acceptable emulsions, solutions, suspensions, syrups and elixirs. In addition to 
the active compounds, the liquid dosage forms can contain inert diluents commonly used in the 
art such as, for example, water or other solvents, solubilizing agents and emulsifiers such as 
ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, 
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propylene glycol, 1,3-butylene glycol, dimethyl formamide, oils (in particular, cottonseed, 
groundnut, corn, germ, olive, castor, and sesame oils), glycerol, tetrahydrofurfuryl alcohol, 
polyethylene glycols and fatty acid esters of soibitan, and mixtures thereof. 

Besides inert diluents, the oral compositions can also include adjuvants such as wetting 
5 agents, emulsifying and suspending agents, sweetening, flavoring, and perfuming agents. 

Suspensions, in addition to the active compounds, can contain suspending agents as, for 
example, ethoxylated isostearyl alcohols, polyoxyethylene sorbitol and sorbitan esters, 
microcrystalline cellulose, aluminum metahydroxide, bentonite, agar-agar, and tragacanth, and 
mixtures thereof. 

10 Alternatively, the composition can be pressurized and contain a compressed gas, such as 

nitrogen or a liquefied gas propellant The liquefied propellant medium and indeed the total 
composition is preferably such that the active ingredients do not dissolve therein to any 
substantial extent The pressurized composition can also contain a surface active agent Hie 
surface active agent can be a liquid or solid non-ionic surface active agent or can be a solid 

15 anionic surface active agent It is preferred to use the solid anionic surface active agent in the 
form of a sodium salt 

The compositions of the present invention can also be administered in the form of 
liposomes. As is known in the art, liposomes are generally derived from phospholipids or other 
lipid substances. liposomes are formed by mono- or multi-lamellar hydrated liquid crystals that 

» are dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 

metabolizable lipid capable of forming liposomes can be used. The present compositions in 
liposome form can contain, in addition to the compounds of the invention, stabilizers, 
preservatives, excipients, and the like. The preferred lipids are the phospholipids and the 
phosphatidyl cholines (lecithins), both natural and synthetic. Methods to form liposomes are 

5 known in the art {see, for example, Prescott, Ed, Metk Cell Biol. 14:33 et seq (1976)). 

One of ordinary skill will appreciate that effective amounts of the agents of the invention 
can be determined empirically and can be employed in pure form or, where such forms exist, in 
pharmaceutically acceptable salt, ester or prodrug form. The agents can be administered to a 
subject, in need of treatment of viral infection, as pharmaceutical compositions in combination 

0 with one or more pharmaceutically acceptable excipients. It will be understood that, when 

administered to a human patient, the total daily usage of the agents or composition of the present 
invention will be decided by the attending physician within the scope of sound medical 
judgement The specific therapeutically effective dose level for any particular patient will 
depend upon a variety of factors: the type and degree of the cellular or physiological response to 

5 be achieved; activity of the specific agent or composition employed; the specific agents or 

composition employed; the age, body weight, general health, sex and diet of the patient; the time 
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of administration, route of administration, and rate of excretion of the agent; the duration of the 
treatment; drugs used in combination or coincidental with the specific agent; and like factors 
well known in the medical arts. For example, it is well within the skill of the art to start doses of 
the agents at levels lower than those required to achieve the desired therapeutic effect and to 
gradually increase the dosages until the desired effect is achieved. 

Dosing can also be arranged in a patient specific manner to provide a predetermined 
concentration of the agents in the blood, as determined by techniques accepted and routine in the 
art Thus patient dosaging can be adjusted to achieve regular on-going blood levels, as measured 
by HPLC, on the order of from 50 to 1000 ng/ml. 

It will be readily apparent to one of ordinary skill in the relevant arts that other suitable 
modifications and adaptations to the methods and applications described herein can be made 
without departing from the scope of the invention or any embodiment thereof. 

The modified siRNA is prepared by custom chemical synthesis by Dharmacon, at 
Lafayette CO. Each C and U within the siRNA duplex (GL2), has been substituted with 2-F-U ■ 
and 2-F-C except for the 3'-end overhang, which was dTYfT. 

To test the stability of 2' chemically modified siRNA compared to unmodified siRNA 
(siRNA), the following experiment is performed 4ngs of siRNA are added to a 20 yL volume of 
80% human serum from a healthy donor. This mixture is incubated at 37°C for various times 
ranging from 1 minute up to 10 days. The same process is performed for 2 f fluorine modified 
siRNA (2-F siRNA). When the incubation process is finished, the mixtures are placed on ice and 
then immediately separated by PAGE along with a 32 P-siRNA control. The 2 1 modified siRNA ' 
is stable as compared to unmodified siRNA. 

V. IDENTIFICATION OF THERAPEUTICALLY ACTIVE AGENTS FOR TREATMENT 
OF SARS VIRUS INFECTION 

The invention provides methods for treating SARS by administering therapeutically active 
agents, such as small molecule compounds, to a mammal, as well as methods of identifying 
therapeutically active agents, such as potent small molecules, for the treatment of SARS virus 
infection. 

In one aspect of the invention a method of identifying a therapeutically active agent is 
provided comprising: (a) contacting the therapeutically active agent with a cell infected with the 
SARS virus; (b) measuring attenuation of a SARS related enzyme. 

In a more particular embodiment, the therapeutically active agent is a small molecule. In 
another more particular embodiment, the therapeutically active agent is a nucleoside analog (e.g. 
Ribavirin). In another more particular embodiment the small molecule is a SMIP or peptidic 
immunomodulating compound. In another more particular embodiment the therapeutically 
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active agent is a peptoid, oligopeptide, or polypeptide. In another embodiment the SARS related 
enzyme is SARS protease. In another embodiment the SARS related enzyme is SARS 
polymerase. In still another embodiment the SARS related enzyme is a kinase. In still another 
embodiment, the SARS related enzyme is a protease. Hie furin inhibitor peptidyl 
5 chloromethylketone prevents blocks cell-cell fusion after MHV infection (de Haan et al (2004) / 
Virol), which offers guidance for SARS therapy. 

The invention includes a cell-based assay that can be used to screen for and identify a 
therapeutically active agent for the treatment of SARS virus infection. Therapeutically active 
agents of the invention include agents that inhibit, prevent or reduce the replication of a SARS 

10 virus. Such agents can be identified by infecting a cultured cell (such as, for example, VERO 
cells) with a SARS virus and evaluating the impact of potential antiviral compounds on SARS 
virus replication. Assays to measure the effect of a potential antiviral compound on virus 
replication are known in the art and may be based on a variety of parameters. 

The cell-based assay may be used in a high-throughput screen to identify therapeutically 

15 active compounds from chemical libraries comprising potential antiviral compounds. 

Therapeutically active compounds suitable for use in the invention may inhibit any SARS viral 
target that is essential for viral replication in whole cells. Efficacy (the ability of a compound to 
inhibit or inactivate the target, be it viral or cellular, that results in the reduction of virus in the 
culture) of the therapeutic agent is measured by assessing the viability and/or the proliferation of 

i0 surviving cells in a SARS virus infected cell culture. 

A number of methods can be used to measure cell viability are known in the art, such as 
assays measuring cellular enzymes, proteins, nucleotide triphoshates (e.g. ATP), nucleic acids 
(e.g. host cell mRNA (e.g. GAPDH) or rRNA sequences) or cellular metabolites such as MTT or 
MTS. In addition, fluorescent (including, for example HSV paper) or non-fluoiescent dyes (e.g. 

15 propidium diiodide) or labeling of DNA can be used to measure indications of cell viability 
and/or proliferation. 

Alternatively, efficacy of a compound or sample can be determined by directly measuring 
the amount of virus or viral products in the culture. Methods for measuring the amount of virus, 
viral genome or viral products include: PCR, RT-PCR, TMA, reporter proteins with fluouiescent 

0 or luminescent qualities or enzymatic functions (e.g., lucif erase, alkaline phosphatase, GFP) or 
proteins that can be detected by antibodies (e.g. EGF) that might be incorporated into the viral 
genome prior to infection of the cell culture. Further, viral products such as viral proteins can be 
measured by BUS A or enzymatic activities. Methods for identifying viral polynucleotides, viral 
proteins and antibodies specific to viral proteins are discussed above. 

5 Potential antiviral compounds are applied to the cell-based assay at a concentration of 

approximately 10 fiM and compound classes having therapeutic effect are identified by 
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measuring the parameter of choice (such as cell viability/proliferation or the virus or viral 
genome or a viral product be it viral in origin or non-virus in orgin). Once compounds are 
identified as having activity, they are resynthesized, and analoged Starting with the identified 
compound, many analogs and new compounds are synthesized during consecutive optimization 
5 cycles of synthesis, biological profiling and modeling techniques to optimize the to the lead 
structure until in vivo activity is elucidated and optimized 

Cells suitable for use in the assay include the cells described above as suitable for vaccine 
production. Preferably, the cells are African green monkey kidney cells (Vero) cells. Human 
embyronic lung fibroblasts or normal human diploid fibroblasts may also be used in the 
10 invention. . 

. In one embodiment, the invention includes a fluorescence based cytopathogenicity assay to 
measure the effect of a potential antiviral compound on a cell-based assay. One example of a 
fluorescence based cytopathogenicity assay is illustrated below. 

1 x 10 4 Vero cells per well of a microtiter plate (MTP) are infected with a defined amount 

15 of SARS Virus selected within the following ranges for optimal MOI: 5-10, 10-25, 25-50, 50- 
100, 100-500, or 500-1000 PFU in a total volume of 200 /il media (M199 medium supplemented 
with 5% FCS, 2 mM glutamine, 100 IU/ml penicillin and 100 /ig/ml streptomycin) in the 
presence or absence of the potential antiviral compound and incubated for at least 1, 2, 3, 4, 5, 6, 
or 7 days at 37°C, 5% CO* The wells of the MTP are washed with PBS (200 /il) and then filled 

10 with 200 ill PBS containing 10 /zg/ml fluorescein diacetate. After a 45-min incubation at room 
temperature, fluorescence is measured at 485 nm excitation and 538 nm emission wavelengths. 
IQo values are determined by a nonlinear plot of antiviral activity as a function of drug 
concentration. 

Other cell based assays are known in the art and include, among others, methods of GFP 
15 detection and Luc detection. In addition, a Promega kit is commercially available that provides 
additional methods of measuring cell viability, etc. 

In one embodiment, the invention includes a method of measuring the efficacy of a 
potential antiviral compound using RT-PCR to detect the levels of SARS viral RNA in the cell 
based assay. Methods of using RT-PCR are known in the art One example of such an assay is 
0 described below. 

5 x 10 6 Vero cells are seeded in tissue culture. Flasks containing the cells are incubated 
over night at 37°C, 5% C0 2 . The cells are infected (m.o.i. = 1) with SARS virus in the presence 
and absence of potential antiviral compounds. Optionally, the cells may be pretreated with the 
potential compound prior to infection. In either case, a suitable control cell assay is also 
5 prepared. 



-213- 



WO 2004/092360 



PCT/US2004/011710 



Hie RNA of infected cells is purified at 2 h (UL54), 12 h (UL8) and 16 h (UL13) after 
infection, (Qiagen) RNA purification (RNeasy kit; 40 ill elution) and quantified (absorption at 
260 nm). Hie RNA (2 fig) is reverse transcribed with a specific primer (2 pmol, using one of the 
primer pairs described herein) into cDNA according to theSuperscript II protocol (Invitrogen). 

5 Aliquots (2 jil) of the reverse transcription reaction are amplified by PCR. Fragments of the 
appropriate target SARS gene, Le. 9 a gene encoding a SARS enzyme, are amplified in 30 cycles 
(UL54 and UL8: 3 min, 94°C hot start, 1 min, 94°C denaturation; 1 min, 55°C annealing; 1 min, 
72 °C polymerization. UL13: 3 min, 94 °C hot start; 1 min, 94 °C denaturation; 1 min, 60 °C 
annealing; 1 min, 72°C polymerization) by PCR (Taq-Polymerase, Stratagene), in a 100-/il 

.0 reaction volume with the appropriate oligonucleotides, as described herein at 0.1 nmol each. 8-/il 
aliquots of cycle 20-30 (lanes 2-12) of the PCR were resolved on a 2% agarose gel (Invitrogen) 
according to the manufacturer's instructions. 

Cell-based assays of the invention may optionally use a variant or derivative of a wild-type 
SARS virus that has reduced or attenuated virulence in humans and/or animal models (e.g., 

5 mouse, non-human primate, etc.) Use of such attenuated SARS viruses in screening methods ' 
may reduce safety concerns and precautions that would otherwise be associated with the 
pathogenic nature of the SARS virus and may eliminate or reduce the need for the 
implementation of cumbersome high containment levels during performance of the assays and 
screening of compounds. 

0 The invention includes an enzyme-based assay that can be used to screen for and identify a 

therapeutically active agent for the treatment of SARS virus infection. 

An embodiment of the invention is an assay comprising contacting a known quantity of 
SARS protease in solution to a peptide containing a detectable marker and cleavage site for 
SARS protease, wherein SARS protease activity is monitored by measuring the intensity of the 
S marker on the cleaved product. 

In a more particular embodiment, a method of assaying for SARS protease is provided 
comprising contacting a sample solution containing SARS protease with a peptide containing a 
fluorescent donor, fluorescent quencher, and cleavage site for SARS protease, said peptide being . 
detectable with a fluorometer when cleaved, wherein SARS protease activity is determined in the 
) sample by the amount of fluorescence detected by the fluorometer. 

Assays based on the direct measurement of SARS protease inhibition may be utilized for 
screening for SARS therapeutics. Protease for such assays such as 3C-like protease and papain- 
like protease may be isolated and purified for such assays as described in Seybert, etaLJ. Gen. 
Virol., 78:71-75, 1997, Ziebuhr, et aL, Adv. Exp. Med Biol., 440:115-120, 1998, Sims, et al., 
> Adv. Exp. Med. Biol. 440:129-134, 1998, Ziebuhr, etaLJ. Virol., 73: 177-185, 1999, Teng, et 
at, J. Virol., 73:2658-2666, 1999, IJerold, et-al, J. Biol. Chem. 274:14918-14925, 1999, and 

-214- 



WO 2004/092360 



PCT/US2004/011710 



Ziebuhr, etcd.,1. Biol. Chem- 276:33220-33232, 2001. Furthermore, Example 30 describes a 
novel method of purifying SARS protease using column chromatography. Example 31 describes 
a continuous fluorescence resonance energy transfer (FRET) assay for measuring SARS protease 
activity. Protease enzyme based assays such as the FRET assay demonstrated in Example 31 are 

5 readily adapted for high- throughput screening and are used for screening candidate antiviral 
compounds. Performance of the protease enzymatic assay in the presence of a SARS protease 
inhibitor compound will show a decreased amount of fluorescence at a given time when 
compared to negative control assay containing no test compound on a non-inhibiting control 
compound. Such a method would involve the steps of: (a) providing an assay solution 

10 comprising SARS protease; (b) adding a test compound to the assay solution; (c) adding a 

substrate for SARS protease to the assay solution; and (d) measuring the proteolytic activity in 
the assay solution. In a preferred embodiment, the proteolytic activity is measured by the 
fluorescence of fluorophore product produced by the enzymatic activity of SARS protease. 
Attenuated SARS virus variants generally contain one or more genome modifications or 

15 mutations (e.g. , substitutions, deletions, insertions) in protein encoding or non-coding regions. 
Specific examples of attenuating mutations include, for example, genetic modifications in the 5'- 
end noncoding region, leader sequence, intergenic regions, 3'-end noncoding region, ORF la, 
ORF lb, S gene, E gene, M gene, N gene, or any of the nonstructural protein genes outside of the 
ORF la/lb region. Preferred attenuating mutations are in a SARS virus structural protein (e.g. y 

10 Spike (S)), a protease or polymerase domain, or a non-coding sequence (e.g., 5'-end noncoding 
region, intergenic sequence). In addition, a cleavage site may be introduced or eliminated within 
the spike protein (see for example, Gombold etaL,]. Virol. 67:4504-4512, 1993; Bos et al 7 
Virology 214:453-463, 1995), such modification that may also be useful for optimization of 
expression of recombinant spike protein antigen (e.g. , for vaccine purposes). 

t5 A variety of methods are used according to the present invention in order to obtain 

attenuated variants of SARS virus. Such methods include serial passage of the SARS virus in 
cultured cells (e.g., mammalian cell culture, such as fetal rhesus kidney cells or VERO cells), 
until the SARS virus demonstrates attenuated function. The serial propagation of virus may be 
performed at any temperature at which tissue culture passage attenuation occurs, and may be 

0 performed in conjunction with one or more steps of mutagenesis (e.g.* chemical mutagenesis). 
The attenuated phenotype of SARS virus variants, obtained after one or more cell culture 
passages, is readily measured by one skilled in the art As used herein, attenuation refers to the 
decreased virulence of the SARS virus in a human subject Evidence of attenuated function may 
be indicated by decreased levels of viral replication or by decreased virulence in an animal 

5 model. 
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Other methods of producing an attenuated SARS virus include cell culture passage of the 
virus at sub-optimal temperatures (cold passage), as well as introduction of attenuating mutations 
into the SARS viral genome by random mutagenesis (e.g., chemical mutagenesis, such as using 
5-fluorouracil) or using directed mutagenesis. Preparation and generation of attenuated RS V 
5 vaccines (the methods of which will generally applicable to SARS virus) are disclosed in, for 
example, EP 0640128, US Patent No. 6,284,254, US Patent No. 5,922326, US Patent No. 
5,882,651. 

The number of passages required to obtain safe, immunizing attenuated virus is dependent 
at least in part on the conditions employed Periodic testing of the SARS virus culture for 

10 virulence and immunizing ability in animals {e.g. , mouse, primate) can readily determine the 
parameters for a particular combination of tissue culture and temperature. 

In another embodiment, the cell-based assay for screening of antiviral compounds is based 
on the readout of expression of a gene product (e.g. , reporter gene product) that is not fiom 
SARS vims. Gene products particularly suitable to the present invention include, but are not 

15 limited to those of the above-described assays. 

In order to achieve such a read-out, the gene-of-interest (GOI) encoding said gene reporter 
gene product must be incorporated into a replicating SARS virus genome or construct derived 
from a SARS virus genome (e.g., SARS virus replicon, SARS virus defective-interfering (DI) 
RNA). Figure 13 is a schematic depicting locations for incorporation of the reporter gene into a 

10 SARS virus genome. Preferably, insertion of a heterologous reporter gene-of-interest is at a site 
between existing SARS virus genes, such as for example, as shown in Figure 13. For example^ 
the GOI may be inserted closely following the termination codon of a SARS virus gene (e.g., 
ORF lb, S, E, M, N). Insertion should be positioned in order to minimize disruption of mRNA 
transcription for the SARS virus gene(s). The GOI may also be inserted as an in-frame "fusion" 

-5 with an existing SARS virus gene, such that sufficient function of the GOI is maintained for 
detection. To optimize expression, an additional SARS virus intergenic sequence (e.g., SEQ ID 
NO: 7388, with or without additional flanking SARS virus sequences) may also be engineered 
into a position preceding the inserted GOI. 

Incorporation of a GOI into SARS virus may be accomplished by one of skill in the art 

\0 using a variety of techniques. For example, one preferred method is targeted RNA 
recombination, that takes advantage of the ability of coronavirus RNAs to undergo 
recombination within the cell (see for example Fischer et aL, J. Virol. 71:5148-5160, 1997; 
Koljesare/ol, J. Vet Sci. 2:149-157, 2001). A construct of desired configuration (e.g., cDNA 
of defective interfering RNA of SARS virus) containing the GOI flanked by SARS virus 

»5 sequence (e.g. , intergenic sequence) is generated such that RNA may be transcribed directly 
within a eukaryotic cell or in vitro and transfected into susceptible cells also infected with SARS 
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viius. Recombinant virus containing the GOI is identified based on expression of the GOI 
encoded marker. 

Alternatively, incorporation of a GOI into SARS virus may be accomplished by one of 
skill in the art by first assembling a full-length cDNA clone of the SARS virus, that can be used 

5 to produce infectious RNA transcripts in vivo (e.g., from an RNA polymerase II promoter) or in 
vitro (e.g. y from a bacteriophage promoter). Although relatively long in genome length, such 
assembly of a full-length cDNA clone is now readily obtainable by one of skill in the art using 
standard molecular biology and reverse genetics techniques and the genome sequence of SARS 
virus (see for example, Thiel et aL t J. Gen. ViroL, 82:1273-1281, 2001; Almazan et al. 9 Proc. 

10 Natl. Acad. Sci. USA 97:5516-5521, 2000; Thiel et al (2003) / Gen Virol 82:1273-1281; Yount 
et al (2003) PNAS USA 100: 12995-13000). Insertion of a heterologous GOI into a full-length 
SARS virus genome cDNA may be performed using a variety of techniques, such as for 
example, ligation into natural or synthetic restriction sites, PCR (e.g., overlapping PCR), and 
recombination. 

15 It may also be desirable to utilize similar SARS virus recombinants containing a gene-of- 

interest for antiviral screening, however, with further modification to minimize or eliminate 
virus-induced cytopathology (e.g., CPE). Non-cytopathic derivatives from SARS virus may be 
obtained by one of skill in the art using a variety of methods. For example, a selectable marker 
(e.g.-, drug resistance marker) may be incorporated as GOI into a SARS virus genome to produce 
20 infectious virus as described above (see for example, Perri et al. , J. Virol., 74:9802-9807, 2000). 
Infectious GOI-containing SARS virus or infectious genome RNA/cDNA is then used to 
infect/transfect cells (e.g. , VERO), with or without prior mutagenesis, after which time the 
infected cells are subjected to the appropriate selection. Only those cells containing SARS virus 
harboring both die selectable marker and one or more mutations rendering the virus non- 
25 cytopathic will survive the selection process and grow out Active SARS virus replication in 
these cells is readily detected using a variety of detection techniques (e.g. 9 PCR, Northern blot) 
and such cells may serve as the substrate for cell-based {screening assays. Mutations that result in 
the desired noncytopathic SARS virus phenotype may include nucleotide substitutions, deletions 
or additions, and may occur in a variety of genome coding or non-coding regions (e.g., 5' or 3 - 
50 end noncoding regions, intergenic regions, ORFla, ORFlb, a protease domain, a polymerase 
domain). The identification of such mutations is readily accomplished by exchange of sequences 
with wild-type (e.g., parental) SARS virus and demonstrating transfer of the phenotype, and 
sequencing of the appropriate genome region. Similar mutations that reduce or eliminate 
cytopathogenicity also may be utilized in the context of a SARS virus derived replicon vector, 
35 either by similar selection directly using a SARS virus replicon or by specific engineering of the 
replicon based on mutation(s) identified in the context of infectious SARS virus as described 
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above. In addition, such mutations may serve as the basis for attenuated SARS virus derivatives, 
as described elsewhere in this document 

Alternatively, rather than using infectious SARS virus or its derivatives for cell-based 
screening assays, propagation defective "replicons" may be engineered and utilized. Such 

5 replicons maintain all protein encoding sequences and cis replication sequences required for 
RNA replication and expression within a cell, but are deleted of one or more sequences or genes 
required for packaging of progeny SARS virus (see for example Curtis et oi, J. Virol., 76:1422- 
1434, 2002). Figure 14 is a schematic depicting representative examples of SARS virus 
replicons according to the present invention. For example a SARS virus cDNA construct is 

10 generated, that is lacking one or more (or all) structural protein encoding genes, whereby the 
missing SARS virus gene(s) is/are replaced by the GOI, maintaining all necessary transcription 
signals for expression of the GOL Operably linked with the SARS virus replicon cDNA 
construct is a promoter for RNA polymerase that can be used to transcribe the replicon RNA in 
vivo (e.g., RNA polymerase II promoter) or in vitro {e.g., bacteriophage promoter). The SARS 

.5 replicon may be introduced into a susceptible cell by transfection as RNA or DNA, depending on < 
the promoter of choice, and the transfected cells may be utilized for the evaluation of antiviral 
compounds. By incorporating one or more mutations rendering the replicon noncytopathic for 
the cells (see above), one can avoid the need for nucleic acid transfection each time an assay is to 
be performed 

O Alternatively, SARS virus replicons may be packaged into virus like particles that allow 

infection of cells, rather than requiring transfection of nucleic acid molecules. A requirement for 
replicon packaging is that essential SARS virus gene functions deleted from the replicon (e.g., 
one or more structural proteins) are provided in trans within the cell containing the replicon. A 
variety of methods for packaging of replicon RNA can be utilized to one of skill in the art (see 

5 for example, Curtis et aL, ibid: Ortego, et a/., J. Virol., 76:11518-11529, 2002). For example, 
stably transformed cell lines constitutively or inducibly expressing the required SARS virus gene 
functions may be utilized Alternatively, the required SARS virus gene functions may be 
expressed by viral vectors that are introduced into the replicon-containing cell. Alternatively a 
defective interfering (DI) SARS virus derived RNA containing the required gene functions may 

3 be introduced into the replicon-containing cell. Such DI constructs used to complement missing 
replicon functions may be more commonly referred to as defective helper RNA or defective 
helpers. 

Another configuration useful for cell-based antiviral screening assays according to the 
present invention utilizes SARS virus derived DI RNAs encoding a GOI (see for example 
) Stirrups, et a/., J. Gen. Virol., 81:1687-1698, 2000; Liao, et a/., Virology 208:319-327, 1995). 
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Introduction of a SARS DI, either as cDNA linked to an RNA polymerase II promoter or as in 
vitro transcribed RNA, into susceptible cells also infected with SARS virus, allows for a readout 
of the GOI reporter product in assays. 

A replicon-based system for rapid identification of coronavirus replicase inhibitors is 

5 described by Hertzig et aL (2004) / Gen Virol DOI 10.1099/vir/0/80044-0. Briefly, the system 
uses a non-cytopathic selectable replicon RNA that can be stably maintained in eukaryotic cells. 
Hie replicon RNA mediates reporter gene expression as a marker for coronavirus replication, 
and expression of the reporter can be used to test the inhibitory effect of test compounds in vitro* 
thereby allowing high throughput screening for replicase inhibitors without the need to grow 

10 infectious virus. Preferred replicon RNAs include a neomycin resistance gene in the replicase 
gene with a downstream reporter gene (e.g. GEP) that is expressed via replicase-mediated 
synthesis of a sub-genomic mRNA. 

VI. COMPOSITIONS AND METHODS FOR TREATMENT OF SARS VIRUS INFECTION 
Hie present invention relates to compositions and methods for the treatment and/or prevention of 

15 SARS. The invention further includes a method for the treatments : 
through the administration of a therapeutically effective amount of at least one antiviral, 
compound from among those described in the US Patents and published international patent 
applications listed in Table 1 and Table 2. In one embodiment of the method, the antiviral 
compound is a small molecule. In another embodiment, the antiviral compound is a protease 

10 inhibitor. In a further embodiment, the antiviral protease inhibitor is a 3C-like protease inhibitor 
and/or a papain-like protease inhibitor. Combined treatment with the lopinavir/ritonavir 
(Kaletra) protease inhibitor and ribavirin has shown a favorable clinical response (Chu et al. 
(2004) Thorax 59:252-256). In another embodiment, the antiviral compound is an inhibitor of an 
RNA dependent RNA polymerase. In another embodiment, a first antiviral compound that is a 

55 protease inhibitor is administered with a second antiviral compound that is an RNA-dependent 
RNA polymerase inhibitor. The invention further provides for the administration of a steroidal 
anti-inflammatory drug in combination with at least one antiviral compound, for example, from 
the antiviral compounds described in the documents listed in Table 1 and Table 2. A 
combination treatment of steroids and ribavirin has been described by Fujii et al. (2004) / Infect 

i0 Chemother 10:1-7. A combination treatment of corticosteroids and interferon alfacon-1 has also 
been reported (Loutfy et aL (2003) JAMA 290:3222-3228). 

The invention further provides for a method for the treatment and/or prevention of SARS 
through the administration of a therapeutically effective amount of at least one antiviral 
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compound from among those described in the US Patents and published international patent 
applications listed in Table 1 and Table 2 by inhalation. In another aspect, the antiviral 
compound may be administered in combination with a SMBP, SMIS, or other 
immunomodulatory compound such as those in Table 34 and in Table 35. hi one embodiment of 

5 the method, the antiviral compound is a small molecule. In another embodiment, the antiviral 
compound is a protease inhibitor, hi a further embodiment, the antiviral protease inhibitor is a 
3C-like protease inhibitor and/or a papain-like protease inhibitor. In another embodiment, the 
antiviral compound is an inhibitor of an RNA dependent RNA polymerase. In another 
embodiment, a first antiviral compound that is a protease inhibitor is administered with a second 

10 antiviral compound that is an RNA-dependent RNA polymerase inhibitor. The invention further 
provides for the administration of a steroidal anti-inflammatory drug in combination with at least 
one antiviral compound, for example, from the antiviral compounds described in the documents 
listed in Table 1 and Table 2. The steroidal anti-inflammatory drug may be administered by 
inhalation for a local effect or administered for systemic absorption such as via an oral or 

15 intravenous route. 

The invention further provides for methods for treating SARS infection comprising 
administering a small molecule immunopotentiator (SMIP) compound either alone or in 
combination with an antiviral compound or in combination with a SARS vaccine. In a further 
embodiment, the SMIP is a compound disclosed herein or set forth in Table 34. 

JO The invention further provides for methods for treating SARS infection comprising 

administering an immunosuppressant compound, optionally a small molecule suppressant 
(SMIS) compound either alone or in combination with an antiviral compound. In a further 
embodiment, the immunosuppressant compound is disclosed herein or set forth in Table 35. 
The invention further provides peptidic immunomodulating compositions, that include 

15 oligo and polypeptides, capable of effecting inflammatory response in a patient In one 

embodiment, the peptidic immunomodulating composition is able to stimulate human cells to 
produce cytokines. In another embodiment the peptidic immunomodulating composition is 
capable of decreasing cytokine levels in the human. Preferred Examples of peptidic 
immunomodulating compositions include those listed in Table 35, as well as TGF£2, TGFpl, 

0 TGFp3, thymopentin (TPS), P-mercaptopropionyl-argmyl— lysyl-aspar^ 

amide, colostrinine, lactoferrin (LF), cyclolinopeptide A (CLA), and tuftsin (TKPR). The 
peptidic immunomodulating compositions of the invention may be used alone or in combination 
with other agents, preferably antiviral compounds, for the treatment of SARS. 
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Hie invention further provides for a kit for use by a consumer for the treatment and/or 
prevention of SARS. Such a kit comprises: a) a pharmaceutical composition comprising a 
therapeutically effective amount of at least one antiviral, SMEP, SMIS, or other 
immunomodulating compound from among those described in the US Patents and published 
5 international patent applications listed in Table 1, Table 2, Table 34 and Table 35 and a 
pharmaceutical^ acceptable earner, vehicle or diluent; b) a container for holding the 
pharmaceutical composition; and, optionally, c) instructions describing a method of using the 
pharmaceutical compositions for the treatment and or the prevention of SARS. Hie kit may 
optionally contain a plurality of compounds for the treatment of SARS wherein the antiviral 

10 compounds are selected from 3C-like protease inhibitors and papain-like protease inhibitors. In 
a further embodiment, the kit contains an antiviral compound that is an RNA-dependent RNA 
polymerase inhibitor. When the kit comprises more than one antiviral, SMEP, SMIS, or other 
immunomodulating compound, the compounds contained in the kit may be optionally combined 
in the same pharmaceutical composition. 

15 An additional aspect of the invention provides for the use of at least one of the antiviral, 

SMIP, SMIS, or other immunomodulating compounds described in the US Patents and published 
international patent applications listed in Table 1, Table 2, Table 34 and Table 35 for the 
manufacture of a medicament for the treatment or prevention of SARS. 

An additional aspect of the invention provides for the use of at least one SMDP compound, 

20 or at least one immunosuppressant compound, or at least one SMIS compound for the 
manufacture of a medicament for the treatment or prevention of SARS. Preferred SMDP, 

immunosuppressant, and SMIS compounds are described herein. 

... 1 J 

Unless otherwise specified, the following terms, when used within Section VI: 
"Compositions and Methods for Treatment of SARS Virus Infection" of the present application 
15 have the meanings as defined below: 

As used herein, "limit", "treat" and 'treatment" are interchangeable terms as are "limiting" 
and "treating" and, as used herein, include preventative (e.g., prophylactic) and palliative 
treatment or the act of providing preventative or palliative treatment The terms include a 
postponement of development of SARS symptoms and/or a reduction in the severity of such 
symptoms that will or are expected to develop following infection with a SARS virus. The terms 
further include ameliorating existing SARS symptoms, preventing additional symptoms, 
ameliorating or preventing the underlying metabolic causes of symptoms. 

Representative uses of the compositions and methods of the present invention include: the 
elimination or reduction of the viral load of the SARS virus in a vertebrate, including humans, 
5 the elimination or reduction of symptoms associated with SARS, and a reduction in morbidity 



-221- 



WO 2004/092360 



PCT/US2004/011710 



associated with SARS. In a SARS patient population, the use of the compositions and methods 
of the invention will result in the reduction in the high mortality rates associated with SARS. 

Infection with the SARS virus and the symptoms associated with SARS can be treated in a 
subject by administering the compositions of the invention. The compositions of the invention 

5 may be administered systemically. For systemic use, the compounds herein are formulated for 
parenteral (e.g., intravenous, subcutaneous, intramuscular, intraperitoneal, intranasal or 
transdermal) or enteral (e.g., oral or rectal) delivery according to conventional methods. 
Intravenous administration can be by a series of injections or by continuous infusion over an 
extended period Administration by injection or other routes of discretely spaced administration 

10 can be performed at intervals ranging from weekly to once to three times daily or more. 
Alternatively, the compositions disclosed herein may be administered in a cyclical manner 
(administration of disclosed composition, followed by no administration, followed by 
administration of disclosed compositions, and the like). Treatment will continue until the desired 
outcome is achieved 

15 A "subject" is a vertebrate animal including a human that is in need of treatment with the 

compositions, methods and kits of the present invention. The term "subject* * or "subjects" is 
intended to refer to both the male and female gender unless one gender is specifically indicated. 

"Coadministration" of a combination of a plurality of antiviral compounds means that these 
components can be administered together as a composition or as part of the same, unitary dosage 

iO form. "Co-administration" also includes administering a plurality of antiviral compounds 

separately but as part of the same therapeutic treatment program or regimen. "Co-administration" 
also includes administering a plurality of other agents, such as, for example an oligopeptide, a 
polypeptide, a peptidic immunomodulator, nucleic acid, antibodies, or a vaccine wherein the 
compounds or agents are administered separately but as part of the same therapeutic treatment • 

25 program or regimen. The components need not necessarily be administered at essentially the 
same time, although they can if so desired "Co-administration" also includes separate 
administration at different times and in any order. For example, where appropriate a patient may 
take one or more components) of the treatment in the morning and the one or more of the other 
components) at night 

30 By "antiviral compound" as used herein is meant an antiviral compound as described in the 

US Patents and published international patent applications listed in Table 1 and Table 2. The US 
Patents and published international patent applications listed in Table 1, Table 2 and Table 35 
are incorporated herein in their entirety. In one embodiment, the antiviral compound is an RNA- 
dependent RNA polymerase. In another preferred embodiment the antiviral compound is a 3C- 

35 like protease inhibitor or a papain-like protease inhibitor. Hie antiviral compounds may be 
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administered in the form of the acid, or of a soluble alkali metal salt or alkaline earth metal salt 
where appropriate. 

The precise dosage of the antiviral compound will vary with the dosing schedule, the oral 
potency of the particular antiviral compound chosen, the age, size, sex and condition of the 
5 subject, the severity of the disorder to be treated, and other relevant medical and physical factors. 
Thus, a precise phannaceutically effective amount cannot be specified in advance and can be 
readily determined by the caregiver or clinician. 

Generally, an appropriate amount of antiviral compound is chosen to obtain a reduction in 
the load of the SARS virus in the subject and/or to obtain a reduction in the symptoms associated 
10 with SARS. For humans, an effective oral dose of antiviral compound is typically from about 
1.5 to about 6000 pgfkg body weight per day and preferably about 10 to about 2000 /ig/kg of 
body weight per day . 

One of ordinary skill in the art will recognize that certain antiviral, SMIP, SMIS, and 
immunomodulating compounds of the invention including 3C-like protease inhibitors, papain- 
15 like protease inhibitators, and RNA-dependent RNA polymerase inhibitors will contain one or 
more atoms that may be in a particular stereochemical, tautomeric, or geometric configuration, . 
giving rise to stereoisomers, tautomers and configurational isomers. All such isomers and 
mixtures thereof are included in this invention, when active. Crystalline and amorphous forms of 
the antiviral compounds of this invention are also included as are hydrates, solvates, polymorphs, 
20 and isomorphs of the antiviral compounds of the invention. 

SMTP compounds of the invention include compounds are described in issued U.S. Patent 
Nos. 4,547,51 1 and 4,738,971 with the general structure (a): 

R" O 

H 

a 

15 for the treatment of disorders responsive to agents that enhance cell-mediated immunity. 

Immunostimulatory oligonucleotides and polynucleotides are described in PCT WO 
98/55495 and PCT WO 98/16247. U.S. Patent Application No. 2002/0164341 describes 
adjuvants including an unmethylated CpG dinucleotide (CpG ODN) and a non-nucleic acid 
adjuvant. U.S. Patent Application No. 2002/0197269 describes compositions comprising an 

JO antigen, an antigenic CpG-ODN and a polycationic polymer. 
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Additionally, issued U.S. Patent Nos. 4,689338, 5,389,640, 5,268,376, 4,929,624, 
5,266,575, 5352,784, 5,494,916, 5,482^36, 5346,905, 5395,937, 5,238,944, 5325,612, 
W099/29693 and U.S. Ser. No. 09/361,544 disclose compounds of the general structure (b): 

R R' 




NH 2 

5 b 

for the use as "immune response modifiers." 

Further compounds with SMIP and antiviral activity are described below and in US 

Patent Application entitled Thiosemicarbazones as Anti-Virals and Lnmunopotentiators filed on 

December 29, 2003 with an attorney docket number of PP19814.004US generally disclosing 
10 compounds of the following structures: 

A compound of formula c: 



Y S 
Y 



1 ...A. 



1 R 1 R" Z 



wherein: E is absent or selected from the group consisting of alkyl, substituted alkyl, 
cycloalkyl, substituted cycloalkyl, aryl, substituted aryl, heterocyclyl, substituted heterocyclyl, 
heteroaryl, and substituted heteroaryl; 

L is absent or is selected from the group consisting of oxo, amino, alkylene, substituted 
alkylene, alkoxy, alkylamino, aminoalkyl, heterocyclyl, carbocyclyl, and caibonyl; 

W is absent or selected from the group consisting of cycloalkyl, substituted cycloalkyl, 
aryl, substituted aryl, heterocyclyl, substituted heterocyclyl, heteroaryl, and substituted 
heteroaryl; 

X is absent or is selected from the group consisting of oxo, amino, alkylene, substituted 
alkylene, alkoxy, alkylamino, aminoalkyl, heterocyclyl, carbocyclyl, and carbonyl; 

Y is selected from the group consisting of cycloalkyl, substituted cycloalkyl, aryl, 
substituted aryl, heterocyclyl, substituted heterocyclyl, heteroaryl, and substituted heteroaryl; 



-224- 



WO 2004/092360 



PCTAJS2004/011710 



Y is absent or is selected from the group consisting of F, CI, Br, I, nitro, alkyl, 
substituted alkyl, and optionally substituted heterocyclyi, amino, alkylamino, dialkylamino; 
Y" is absent or is selected from the group consisting of F, CI, Br, I, nitro, alkyl, substituted alkyl, 
and optionally substituted heterocyclyi, amino, alkylamino, dialkylamino; 
5 R' is H, alkyl, or substituted alkyl; 
R"isH,or 

R' and R' * are taken together to form a hetercyclic ring; 

Z and Z' are independently selected from the group consisting of hydrogen, alkyl, substituted 
alkyl, aryl, substituted aryl, aryl alkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, 

10 heteroarylalkyl, substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, 
alkoxycarbonyl, carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted 
alkylcarbonyl, arylcaibonyl, aralkylcaibonyl, heteroarylcarbonyl, heteroaralkylcaibonyl, 
alkylcaibonyloxy, arylcarbonyloxy, aralkylcarbonyloxy, heteroarylcaibonyloxy, 
heteroaralkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, formyl, 

IS loweralkylcaibonyl, loweralkoxycarbonyl, aminocarbonyl, aminoaryl, alkylsulfonyl, 
sulfonamido, aminoalkoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 
alkylaininocarbonylamino, arylaminocaibonylamino, aralkylcaibonylamino, 
heteroarylcarbonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloimido, 
arylsulfonyl and arylsulfonamido; or 

20 Z and Z' are taken together to form a heterocyclic group, that may be optionally substituted and 
the tautomers and the pharmaceutical^ acceptable salts* esters, or prodrugs thereof. 

. . Further SMTP compounds are described below and in US Patent Application 107762873, 
Use of Tryptanthrin Compounds for Immune Potentiation, filed on January 21, 2004 and 
disclosing the general embodiment of compounds represented by Formula (d): 



25 




wherein 

A, B, C, D, E, F, G, and H are independently selected from carbon and nitrogen, or A and 

B and/or C and D can be taken together to be nitrogen or sulfur, 
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R b R2, R3, Rj, R8» and Rio are independently selected from the group consisting of hydrogen, 
halogen, loweralkyl, alkyl, substituted alkyl, cycloalkyl, heterocyclyl, alkylheterocyclyl, 
substituted heterocyclyl, substituted alkenyl, amino, (substituted alkyl)(alkyl)amino, imino, 
haloloweralkyl, hydroxy, alkoxy, substituted alkoxy, hydroxyalkylthio, nitro, alkylsulfonyl, N- 
5 alkylsulfonamide, arylalkyl, arylalkylaryl, arylaryl, aryloxy, arylamino, acylamino, 

acyloxyamino, alkylaminoacylamino, alkylaminosulfonylamino, alkylamino, alkenylamino, 
dialkylamino, alkoxyalkylamino, alkoxyalkylheterocyclyl, mercaptoalkoxyalkyl, cyano, 
formyl, -COOR n wherein R u is hydrogen, loweralkyl, aryl, heterocyclyl, monosaccharide or 
disaccharide, and -CONR 12 R 13 wherein R 12 and R 13 are independently selected from hydrogen, 
10 loweralkyl, aryl, heterocyclyl, saccharide, peptide and amino acid residues; or R 2 and R 3 taken 
together form a six membered aromatic ring; 

R 7 and Rg are independently selected from hydrogen, halogen, loweralkyl, 
haloloweralkyl, cycloalkyl, heterocyclyl, substituted heterocyclyl or heterocyclylalkyl; and 

Rj, R 2 , R3, R4, R7, Rs. R9,and R 10 are absent when the ring atom to which they would 
5 otherwise be bonded is sulfur or double-bonded nitrogen; or 

the a phannaceutically acceptable salts, esters, or prodrugs thereof, 
provided that R lf R 2 , R 3 , R4, R 7 > Rs> R9, and Ri 0 are not all hydrogen when A, B, C, E, F, and 
H are carbon. 

hi one embodiment, the compounds of Formula (J) have a backbone structure wherein D is 
0 nitrogen, and A-C and E-H are carbon. 

In one embodiment, when D is carbon, at least one, or at least two of R x - R4 and R 7 - R 10 are 
not hydrogen. 

In one embodiment, Rj through and R 8 and R 10 are independently selected from at least two 
of the group consisting of hydrogen, halogen, loweralkyl, cycloalkyl, heterocyclyl, substituted 

5 heterocyclyl, alkylheterocyclyl, amino, imino, haloloweralkyl, alkoxy, nitro, alkylsulfonyl, 
arylalkyl, arylalkylaryl, arylaryl, aryloxy, arylamino, acylamino, acyloxyamino, 
alkylaminoacylamino, alkylaminosulfonylamino, alkylamino, alkenylamino, dialkylamino, 
alkoxyalkylamino, alkoxyalkylheterocyclyl, mercaptoalkoxyalkyl, cyano, formyl, -COOR n 
where R n is hydrogen, loweralkyl, aryl, heterocyclyl, monosaccharide or disaccharide, 

) and -CONR 12 R 13 where R 12 and R 13 are independently selected from hydrogen, loweralkyl, 
aryl, heterocyclyl, saccharide, peptide and amino acid residues; and R4 is not present when D is 
nitrogen. 
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In an additional embodiment, 4 A, B, C, D, E, F, G, and H are independently selected from 
carbon and nitrogen; 

Rj, R2, R 3 , R4 t Rg and Rjq are independently selected from the group consisting of 
hydrogen, halogen, loweralkyl, alkyl, substituted alkyl, heterocyclyl, substituted heterocyclyl, 
5 substituted alkenyl, (substituted alkyl)(alkyl)amino, haloloweralkyl, hydroxy, alkoxy, substituted 
alkoxy, hydroxyalkylthio, nitro, N-alkylsulfonamide, cyano, -COOR^ wherein R^ is hydrogen, 
loweralkyl, aryl, heterocyclyl, monosaccharide or disaccharide, and -CONR12R13 wherein J&u 
and R13 are independently selected from hydrogen, loweralkyl, aryl, heterocyclyl, saccharide, 
peptide and amino acid residues. 

10 

For the compounds described herein: ' 

The term "loweralkyl" refers to branched or straight chain acyclical alkyl groups 
comprising pne to ten carbon atoms, including, e.g., methyl, ethyl, propyl, isopropyl, n^butyl, 
t-butyl, neopentyl and the like. 

15 The term "alkyl" refers to alkyl groups that do not contain heteroatoms. Thus the term 

includes straight chain alkyl groups such as methyl, ethyl, propyl, butyl, pentyl, hexyl, heptyl, 
octyl, nonyl, decyl, undecyl, dodecyl and the like. The phrase also includes branched chain 
isomers of straight chain alkyl groups, including but not limited to, the following that are 
provided by way of example: -CH(CH 3 )2, <H(GH 3 )(CH2CH 3 ), -CH(CH 2 GH 3 )2, - 

10 C(CH 3 )3, -C(CH 2 CH 3 )3, -CH 2 CH(CH 3 )2, -CH 2 CH(CH 3 )(CH 2 CH 3 ), - 
CH 2 CH(CH 2 CH 3 )2, -CH 2 C(CK 3 h, -CH 2 C(CH 2 CH 3 ) 3 , - 

CH(CH 3 )CH(CH 3 )(CH 2 CH 3 ), -CH^CHCCH^, -OT 2 CH 2 CH(CH 3 )(CH 2 CH 3 ), - 
CH 2 CH 2 CH(CH 2 CH 3 )2, -CH 2 CH 2 C(CH 3 )3, -CHzCHzCC^CH^^ 
CH(CH 3 )CH 2 CH(CH 3 )2, -CH(CH 3 )CH(CH 3 )CH(CH 3 )2, - 

5 CH(CH 2 CH 3 )CT(^ and others. The term also includes cyclic alkyl 

groups such as cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, and cyclooctyl and 
such rings substituted with straight and branched chain alkyl groups as defined above. The term 
also includes polycyclic alkyl groups such as, but not limited to, adamantyl norbornyl, and 
bicyclo[2.2.2]octyl and such rings substituted with straight and branched chain alkyl groups as 

0 defined above. Thus, the phrase unsubstituted alkyl groups includes primary alkyl groups, 

secondary alkyl groups, and tertiary alkyl groups. Unsubstituted alkyl groups may be bonded to 
one or more carbon atom(s), oxygen atom(s), nitrogen atom(s), and/or sulfur atom(s) in the 
parent compound. Preferred unsubstituted alkyl groups include straight and branched chain alkyl 
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groups and cyclic alkyl groups having 1 to 20 carbon atoms. More preferred such unsubstituted 
alkyl groups have from 1 to 10 carbon atoms while even more preferred such groups have from 1 
to 5 carbon atoms. Most preferred unsubstituted alkyl groups include straight and branched 
chain alkyl groups having from 1 to 3 carbon atoms and include methyl, ethyl, propyl, and - 
CH(CH 3 )2. 

Hie phrase "substituted alkyl" refers to an unsubstituted alkyl group as defined above in 
which one or more bonds to a carbon(s) or hydrogen(s) are replaced by a bond to non-hydrogen 
and non-carbon atoms such as, but not limited to, a halogen atom in halides such as F, CI, Br, 
and I; a phosphorus atom in groups such as phosphate and dialkyl alkylphosphonate; oxygen 
atom in groups such as hydroxyl groups, alkoxy groups, aryloxy groups, and ester groups; a 
sulfur atom in groups such as thiol groups, alkyl and aryl sulfide groups, sulfone groups, sulfonyl 
groups, and sulfoxide groups; a nitrogen atom in groups such as amines, amides, alkylamines, 
dialkylamines, arylamines, alkylarylamines, diarylamines, N-oxides, imides, and enamines; a 
silicon atom in groups such as in trialkylsilyl groups, dialkylarylsilyl groups, alkyldiarylsilyl 
groups, and triarylsilyl groups; and other heteroatoms in various other groups. Substituted alkyl 
groups also include groups in which one or more bonds to a carbon(s) or hydrogen(s) atom is 
replaced by a bond to a heteroatom such as oxygen in carbonyl, carboxyl, and ester groups; 
nitrogen in groups such as imines, oximes, hydrazones, and nitriles. Preferred substituted alkyl 
groups include, among others, alkyl groups in which one or more bonds to a carbon or hydrogen 
atom is/are replaced by one or more bonds to fluorine atoms. One example of a substituted alkyl 
group is the trifluoromethyl group and other alkyl groups that contain the trifluoromethyl group. 
Other alkyl groups include those in which one or more bonds to a carbon or hydrogen atom is 
replaced by a bond to an oxygen atom such that the substituted alkyl group contains a hydroxyl, 
alkoxy, aryloxy group, or heterocyclyloxy group. Still other alkyl groups include alkyl groups 
that have an amine, alkylamine, dialkylamine, arylamine, (alkyl)(aryl)amine,diarylamine, 
heterocyclylamine, (alkyl)(heterocyclyl)amine, (aryl)(heterocyclyl)amine, or 
diheterocyclylamine group. 

The term "alkoxy" refers to RO- wherein R, for example, is alkyl such as loweralkyl 
defined above. Representative examples of loweralkyl alkoxy groups include methoxy, ethoxy, 
t-butoxy and the like. 

The phrase "substituted alkoxy" refers to RO-, where R is, for example, an alkyl 
substituted, for example, with a halogen. RO is for example OCF 3 . 
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The term "alkenyl" refers to a branched or straight chain groups comprising two to 
twenty carbon atoms that also comprises one or more carbon-carbon double bonds. 
Representative alkenyl groups include prenyl, 2-propenyl (i.e., allyl), 3-methyl-2-butenyl, 
3,7-dimethyl-2,6-octadienyl, 4,8-dimethyl-3,7-nonadienyl, 3,741-trimethyl-2,6,10-dodecatrienyl 
5 and the like. 

The phrase "substituted alkenyl" refers to alkenyl groups that are substituted, for 
example, diethyl hex-5-enylphosponate, and others with an alkyl or substituted alkyl group such 
as di alkyl phosphate or an ester such as an acetate ester. 

Hie phrase "dialkyl amino" refers to an amino group substituted with two alkyl groups 
10 such as Cl-20 alkyl groups. 

The phrase "substituted dialkyl amino" refers to a dialkylamino substituted, for example, 
with a carboxylic acid, ester, hydroxy or alkoxy. 

The term "hydroxyalkylthio" refers to a thio radical to which is appended a hydroxyalkyl 
group, where the alkyl is for example lower alkyl. An example is hydroxyethylthio, - 
15 SCH2CH2OH. 

. The term 'TNT-alkylsulfonamide" refers to the group -SOzNHalkyl, where alkyl is, for 
example, octyl. 

The term "alkynyl" refers to a branched or straight chain comprising two to twenty 
carbon atoms that also comprises one or more carbon-carbon triple bonds. Representative 

10 alkynyl groups include ethynyl, 2-propynyl (propargyl), 1-propynyl and the like. 

The term "aryl" refers to aryl groups that do not contain heteroatoms. Thus the term 
includes, but is not limited to, groups such as phenyl, biphenyl, anthracenyl, naphthenyl by way 
of example. Although the phrase "unsubstituted aryl" includes groups containing condensed 
rings such as naphthalene, it does not include aryl groups that have other groups such as alkyl or 

15 halo groups bonded to one of the ring members, as aryl groups such as tolyl are considered 
herein to be substituted aryl groups as described below. A preferred unsubstituted aryl group is 
phenyl. Unsubstituted aryl groups may be bonded to one or more carbon atom(s), oxygen 
atom(s), nitrogen atom(s), and/or sulfur atom(s) in the parent compound, however. 

The phrase "substituted aryl group" has the same meaning with respect to aryl groups that 

10 substituted alkyl groups had with respect to alkyl groups. However, a substituted aryl group also 
includes aryl groups in which one of the aromatic carbons is bonded to one of the non-carbon or 
non-hydrogen atoms described above and also includes aryl groups in which one or more 
aromatic carbons of the aryl group is bonded to a substituted and/or unsubstituted alkyl, alkenyl, 
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or alkynyl group as defined herein. This includes bonding arrangements in which two carbon 
atoms of an aryl group are bonded to two atoms of an alkyl, alkenyl, or alkynyl group to define a 
fused ring system (e.g. dihydronaphthyl or tetrahydronaphthyl). Thus, the phrase "substituted 
aryl" includes, but is not limited to tolyl, and hydroxyphenyl among others. 

5 The term "arylalkyl" refers to a loweralkyl radical to which is appended an aryl group. 

Representative arylalkyl groups include benzyl, phenylethyl, hydroxybenzyl, fluorobenzyl, 
fluorophenylethyl and the like. 

The phrase "unfused arylaryl" refers to a group or substituent to which two aryl groups, 
that are not condensed to each other, are bound. Exemplary unfused arylaryl compounds include, 

10 for example, phenylbenzene, diphenyldiazene, 4-metoyltmo-l-phenylbenzene, phenoxybenzene, 
(2-phenylethynyl)benzene, diphenyl ketone, (4-phenylbuta-13-diynyl)benzene, 
phenylbenzylamine, (phenylmethoxy)benzene, and the like. Preferred substituted unfused 
arylaryl groups include: 2-(phenylarnmo)-N-[4^2-phenylethynyl)phenyl]acetamide, 1,4- 
diphenylbenzene, N-[4-(2-phenylethynyl)phenyl]-2-[ben^lammo]acetamide, 2-amino-N-[4-(2- 

5 phenylethynyl)phenyl]propanamide, 2-anmio-N-[4^2-phenylethynyl)phenyl]acetamide, 2- 
(cyclopropylainmo)-N-[4^2-phenylemynyl)phenyl]acetannde, 2-(ethylamino)-N-[4-(2- 
phenylethynyl)phenyl]acetamide,2-[(2-methylpropyl)amino]-N-[4-(2- 

phenylefoynyl)phenyl]a(*tannde,^ : 
phenylbenzene, 2-[(imidazolylmetoyl)ammo]-N-[4^2-phenylemyn^^ 
0 phenyl-l-phenoxybenzene, N^2-anmioethyl)[4-(2-phenyletoynyl)phenyl]carboxamide, 2-{[(4- .» 
fluorophenyl)metoyl]aiiHno}-N-[^^ 
metoylphenyl)methyl]annno}-N-^ 

(trifluoromethyl)benzene, l-butyl-4-phenylbenzene, 2-(cyclohexylamino>N-[4-(2- 
phenylethynyl)phenyl]acetamide, 2-(ethylmethylamino)-N-[4-(2- 
5 phenylethynyl)phenyl]acetamide, 2^utylamino>N-[4-(2-phenylemynyl)phenyl]acetamide,N- 
[4-(2-phenylemynyl)phenyl]-2-(4-pyrid^^ 
(qutouchdm-3-ylamino)acetamide^^ 
2-animc-3-mefoyl-N-[4^2-pheny^ 

diynyl)phenylamine, 2-(oUmethylammo)-N-[4-(4-pheny^ 2- 
) (ethylammo)-N-[4-(4-phenylbu^ !_[4_ 
(2-phenyleftynyl)phenyl]etoan-l-one,N-(l^^ 
diynyl)phenyl]carboxamide,N-[4-(2-phenyletoynyl)phenyl]propanami^ 
phenyl ketone, phenyl-N-benzamide, (tert-butoxy>N-[(4-phenylphenyl)methyl]carboxamide, 2- 
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(3-phenylphenoxy)ethanehydroxamic acid, 3-phenylphenyl propanoate, l-(4-eihoxyphenyl>4- 
methoxybenzene, and [4-{2-phen3dethynyl)phenyl]pynole. 

Hie phrase "unfused heteroarylaryl" refers to a unfused arylaryl group where one of the 
aryl groups is a heteroaryl group. Exemplary heteroarylaryl groups include, for example, 2- 

5 phenylpyridine, phenylpyrrole, 3-(2-phenylethynyl)pyridine, phenylpyrazole, 5-(2- 
phenylethynyl)-13-^ydropyrimidine-2,4-dione, 4-phenyl-l,2,3-thiadiazole, 2-(2- 
phenylethynyl)pyrazine, 2-phenylthiophene, phenylimidazole, 3-(2-piperazinylphenyl)furan, 3- 
(2,4-dichlorophenyl)-4-methylpyrrole, and the like. Preferred substituted unfused heteroarylaryl 
groups include: 5^2-phenylethynyl)pyrimidine-2-ylamine, 1 -methoxy-4-(2-thienyl)benzene, 1- 

10 methoxy-3-(2-thienyl)benzene, 5-methyl-2-phenylpyridine, 5-methyl-3-phenylisoxazole, 2-[3- 
(trifluoromethyl)phenyl]furan, 3-fluoro-5^2-furyl)-2-methoxy-l-prop-2-enylbenzene, 
(hydroxyimino)(5-phenyl(2-thienyl))methane, 5-[(4-methylpiperazinyl)methyl]-2- 
phenylthiophene, 2-(4-ethylphenyl)thiophene, 4-methylthio-l-(2-thienyl)benzene, 2-(3^ 
nitrophenyl)thiophene, (tert-butoxy>N-[(5-phenyl(3-pyridyl))methyl]caiboxamide, hydrdxy-N- 

15 [(5-phenyl(3-pyridyl))methyl]amide, 2-(phenylmethylthio)pyridine, and benzylimidazole. 

The phrase "unfused heteroarylheteroaryl" refers to an unfused arylaryl group where both 
of the aryl groups is a heteroaryl group. Exemplary heteroarylheteroaryl groups include, for 
example, 3-pyridylimidazole, 2-imidazolylpyrazine, and the like. Preferred substituted unfused 
heteroarylheteroaryl groups include: 2-(4-piperazinyl-3-pyridyl)furan, diethyl(3-pyrazin-2-yl(4- 

10 pyridyl))amine, and dimethyl{2-[2-(5-methylpyrazin-2-yl)ethynyl](4-pyri 

i The phrase "fused arylaryl" refers to an aryl group as previously defined that is 
condensed, and fully conjugated to an aryl group. Representative fused arylaryl groups include 
biphenyl, 4~(l-naphthyl)phenyl, 4-(2-naphthyl)phenyl and the like. 

The phrase "fused heteroarylaryl" refers to an aryl group as previously defined that is 

i5 condensed, and fully conjugated with a heteroaryl group. Representative fused heteroarylaryl 
groups include quinoline, quinazoline and the like. 

Hie phrase "fused heteroarylheteroaryl" refers to a heteroaryl group as previously defined 
that is condensed, and fully conjugated with another heteroaryl group. Representative fused 
heteroarylheteroaryl groups include pyrazalopyrimidine, imidazoquinoline and the like. 

\0 Hie term "aryloxy" refers to RO- wherein R is an aryl group. Representative arylalkoxy 

group include benzyloxy, phenylethoxy and the like. 

Hie term "arylalkoxy" refers to a lower alkoxy radical to which is appended an aryl 
group. Representative arylalkoxy group include benzyloxy, phenylethoxy and the like. 
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The term "aryloxyaryl" refers to an aryl radical to which is appended an aryloxy group. 
Representative aryloxyaryl groups include 4-phenoxyphenyl, 3-phenoxyphenyl, 
4-phenoxy-l-naphthyl, 3-phenoxy-l-naphthyl and the like. 

The tenn "aryloxyarylalkyl" refers to an arylalkyl radical to which is appended an 
aryloxy group. Representative aryloxyarylalkyl groups include 4-phenoxyphenylmethyl, 
3-phenoxyphenylmethyl, 4-phenoxyphenylethyl, 3-phenoxy-phenylethyl and the like. 

The term "arylalkoxyaryl" refers to an aryl radical to which is appended an arylalkoxy 
group. Representative arylalkoxyaryl groups include 4-benzyloxylphenyl, 3-benzyloxyphenyl 
and the like. 

The term "arylalkoxyarylalkyl" refers to an arylalkyl radical to which is appended an 
arylalkoxy group. Representative arylalkoxyarylalkyl groups include 4-benzyloxylbenzyl, 
3-benzyloxybenzyl and the like. 

The term "cycloalkyl" refers to an alicyclic group comprising from 3 to 7 carbon atoms 
including, but not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl and the like. 

The term "cycloalkylalkyl" refers to a loweralkyl radical to which is appended a 
cycloalkyl group. Representative examples of cycloalkylalkyl include cyclopropylmethyl, 
cyclohexylmethyl, 2-(cyclopropyl)ethyl and the like. 

The term "halogen" refers to iodine, bromine, chlorine or fluorine; "halo" refers to iodo, 
bromo, chloro or fluoro. ■ 

The term "haloalkyl" refers to a lower alkyl radical, as defined above, bearing at least one 
halogen substituent, for example, chloromethyl, fluoroethyl or trifluoromethyl and the like. 

The term "heterocyclyr (or heterocyclic, or heterocyclo) refers to both aromatic and 
nonaromatic ring compounds including monocyclic, bicyclic, and polycyclic ring compounds 
such as, but not limited to, quinuclidyl, containing 3 or more ring members of which one or more 
is a heteroatom such as, but not limited to, N, O, and S. Although the phrase "unsubstituted 
heterocyclyr includes condensed heterocyclic rings such as benzimidazolyl, it does not include 
heterocyclyl groups that have other groups such as alkyl or halo groups bonded to one of the ring 
members as compounds such as 2-methylbenzdmidazolyl are substituted heterocyclyl groups. 
Examples of heterocyclyl groups include, but are not limited to: unsaturated 3 to 8 membered 
rings containing 1 to 4 nitrogen atoms such as, but not limited to pyrrolyl, pyrrolinyl, imidazolyl, 
pyrazolyl, pyridyl, dihydropyridyl, pyrimidyl, pyrazinyl, pyridazinyl, triazolyl (e.g. 4H-1 ,2,4- 
triazolyl, lH-l,2,3-triazolyl, 2H-l,2,3-triazolyl etc.), tetrazolyl, (e.g. lH-tetrazolyl, 2H 
tetrazolyl, etc.); saturated 3 to 8 membered rings containing 1 to 4 nitrogen atoms such as, but 
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not limited to, pyrrolidinyl, imidazolidinyl, piperidinyl, piperazinyl; condensed unsaturated 
heterocyclic groups containing 1 to 4 nitrogen atoms such as, but not limited to, indolyl, 
isoindolyl, indolinyl, indolizinyl, benzimidazolyl, quinolyl, isoquinolyl, indazolyl, 
benzotriazolyl; unsaturated 3 to 8 membeied rings containing 1 to 2 oxygen atoms and 1 to 3 

5 nitrogen atoms such as, but not limited to, oxazolyl, isoxazolyl, oxadiazolyl (e.g. 1,2,4- 
oxadiazolyl, 1,3,4-oxadiazolyl, 1,2,5-oxadiazolyl, etc.); saturated 3 to 8 membered rings 
containing 1 to 2oxygen atoms and 1 to 3 nitrogen atoms such as, but not limited to, 
morpholinyl; unsaturated condensed heterocyclic groups containing 1 to 2 oxygen atoms and 1 to 
3 nitrogen atoms, for example, benzoxazolyl, benzoxadiazolyl, benzoxazdnyl (e.g. 2H-1,4- 

10 benzoxazinyl etc.); unsaturated 3 to 8 membered rings containing 1 to 3 sulfur atoms and 1 to 3 
nitrogen atoms such as, but not limited to, thiazolyl, isothiazolyl, thiadiazolyl (e.g. 1,2,3- 
thiadiazolyl, 1,2,4-thiadiazolyl, 1,3,4-thiadiazolyl, 1,2,5-thiadiazolyl, etc.); saturated 3 to 8 
membered rings containing 1 to 2 sulfur atoms and 1 to 3 nitrogen atoms such as, but not limited 
to, thiazolodinyl; saturated and unsaturated 3 to 8 membered rings containing 1 to 2 sulfur atoms 

15 such as, but not limited to, thienyl, dihydrodithiinyl, dihydrodithionyl, tetrahydrothiophene, 

tetrahydrothiopyran; unsaturated condensed heterocyclic rings containing 1 to 2 sulfur atoms and 
1 to 3 nitrogen atoms such as, but not limited to, benzothiazolyl, benzothiadiazolyl, 
benzothiazinyl (e.g. 2H-l,4-benzothi&zinyl, etc.), dihydrobenzothiazinyl (e.g. 2H-3,4- 
dihydrobenzothiazinyl, etc.), unsaturated 3 to 8 membered rings containing oxygen atoms such 

20 as, but not limited to furyl; unsaturated condensed heterocyclic rings containing 1 to 2 oxygen 
atoms such as benzodioxolyl (e.g. 1,3-benzodioxoyl, etc.); unsaturated 3 to 8 membered rings 
containing an oxygen atom and 1 to 2 sulfur atoms such as, but not limited to, dihydrooxathiinyl; 
saturated 3 to 8 membered rings containing 1 to 2 oxygen atoms and 1 to 2 sulfur atoms such as 
1,4-oxathiane; unsaturated condensed rings containing 1 to 2 sulfur atoms such as benzothienyl, 

15 benzodithiinyl; and unsaturated condensed heterocyclic rings containing an oxygen atom and 1 
to 2 oxygen atoms such as benzoxathiinyL Heterocyclyl group also include those described 
above in which one or more S atoms in the ling is double-bonded to one or two oxygen atoms 
(sulfoxides and sulfones). For example, heterocyclyl groups include tetrahydrothiophene, 
tetrahydrothiophene oxide, and tetrahydrothiophene 1,1-dioxide. Preferred heterocyclyl groups 

50 contain 5 or 6 ring members. More preferred heterocyclyl groups include morpholine, 

piperazine, piperidine, pyrrolidine, imidazole, pyrazole, 1,2,3-triazole, 1,2,4-triazole, tetrazole, 
thiomorpholine, thiomorpholine in which the S atom of the thiomorpholine is bonded to one or 
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more O atoms, pyrrole, homopiperazine, oxazolidin-2-one, pyrrolidin-2-one, oxazole, 
quinuclidine, thiazole, isoxazole, furan, and tetrahydrofuran. 

Hie phrase "substituted heterocyclyl" refers to an heterocyclyl group as defined above in 
which one of die ring members is bonded to a non-hydrogen atom such as described above with 
5 respect to substituted alkyl groups and substituted aryl groups. Examples, include, but are not 
limited to, 2-methylbenzimidazolyl, 5-methylbenzimidazolyl, 5-chlorobenzthiazolyl, 1-methyl 
piperazinyl, and 2-chloropyridyl among others. 

"Aminosulfonyr refers to the group -S(0) 2-NBfe. "Substituted aminosulfonyP refers to 
the group -S(O) 2-NRR' where R is loweralkyl and R' is hydrogen or a loweralkyl. The term 
10 "aralkylaminosulf onlyaryl" refers to the group -aryl-S(O) 2-NH-aralkyl, whore the aralkyl is 
loweraralkyl. 

"Carbonyl" refers to the divalent group -C(O)-. 

"Carbonyloxy" refers generally to the group -C(0)-0-,. Such groups include 
esters; -C(0)-0-R, where R is loweralkyl, cycloalkyl, aryl, or loweraralkyl. Hie term 

15 "carbonyloxycycloalkyl" refers generally to both an "carbonyloxycarbocycloalkyl" arid an 
"carbonyloxyheterocycloalkyl", i.e., where R is a carbocycloalkyl or heterocycloalkyl,* 
respectively. The term "arylcarbonyloxy" refers to the group -C(0)-0-aryl, where aryl is a 
mono- or polycyclic, carbocycloaryl or heterocycloaryl. Hie term "aralkylcarbonyloxy" refers to 
the group -C(0>0-aralkyl, where the aralkyl is loweraralkyl. 

20 -.: The term "sulfonyl" refers to the group -SO2-. "Alkylsulfonyl" refers to a substituted 
sulfonyl of the structure -S0 2 R - in which R is alkyl. Alkylsulfonyl groups employed in 
compounds of the present invention are typically loweralkylsulfonyl groups having from 1 to 6 • • 
carbon atoms in its backbone structure. Thus* typical alkylsulfonyl groups employed in 
compounds of the present invention include, for example, methylsulfonyl (i.e., where R is 

25 methyl), ethylsulfonyl (i.e., where R is ethyl), propylsulfonyl (i.e., where R is propyl), and the 
like.- The term "arylsulfonyl" refersto the group -SC^-aryl. The term "aralkylsulfonyl" refers to 
o the group -SCVaralkyl, in which the aralkyl is loweraralkyl. The term "sulfonamido" refers to - 
SO2NH2. 

The term "carbonylamino" refers to the divalent group -NH-C(O)- in which the hydrogen 
30 atom of the amide nitrogen of the carbonylamino group can be replaced a loweralkyl, aryl, or 
loweraralkyl group. Such groups include moieties such as carbamate esters (-NH-C(O)-O-R) and 
amides -NH-C(0)-O-R, where R is a straight or branched chain loweralkyl, cycloalkyl, or aryl 
or loweraralkyl. The term "loweralkylcaAonylamino" refers to alkylcarbonylamino where R is a 
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loweraDcyl having from 1 to about 6 carbon atoms in its backbone structure. The term 
"arylcarbonylamino" refers to group -^NH-C(0)-R where R is an aryl. Similarly, the term 
"aralkylcarbonylamino n refers to carbonylamino where R is a lower aralkyl. 

The term "guanidino" or "guanidyl" refers to moieties derived from guanidine, 

5 H2N-C(=NH>NH2. Such moieties include those bonded at the nitrogen atom carrying the formal 
double bond (the rt 2"-position of the guanidine, e.g., diaminomethyleneamino, (H2N>2C=NH-) 
and those bonded at either of the nitrogen atoms carrying a formal single bond (the " 1-" and/or 
n 3"-positions of the guandine, e.g., H;aN-C(=NH)-NH-). Hie hydrogen atoms at any of the 
nitrogens can be replaced with a suitable substituent, such as loweralkyl, aryl, or loweraralkyl. 

10 Representative cycloimido and heterocycloimido groups include, for example, those 

shown below. These cycloimido and heterocycloimido can be further substituted and may be 
attached at various positions as will be apparent to those having skill in the organic and 
medicinal chemistry arts in conjunction with the disclosure herein. 



L5 




Representative substituted amidino and heterocycloamidino groups include, for example, 
10 those shown below. These amidino and heterocycloamidino groups can be further substituted as 
will be apparent to those having skill in the organic and medicinal chemistry arts in conjunction 
with the disclosure herein. 
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and H 

Representative substituted alkylcarbonylamino, alkyloxycarbonylamino, 
aminoalkyloxycarbonylamino, and arylcarbonylamino groups include, for example, those shown 
below. These groups can be further substituted as will be apparent to those having skill in the 
organic and medicinal chemistry arts in conjunction with the disclosure herein. 

> i 

NlH H ^NH | ^NH NH | I 

' ' ' ' 




and oh 



Representative substituted aminocarbonyl groups include, for example, those shown 
below. These can heterocyclo groups be further substituted as will be apparent to those having 
skill in the organic and medicinal chemistry arts in conjunction with the disclosure herein. 

X v v r 




Representative substituted alkoxycarbonyl groups include, for example, those shown 
below. These alkoxycarbonyl groups can be further substituted as will be apparent to those 
having skill in the organic and medicinal chemistry arts in conjunction with the disclosure 
herein. 
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5 "Substituted" refers to the definite replacement of hydrogen with one or more 

monovalent or divalent radicals. Suitable substitution groups include, those described herein for 
particular groups, as well as hydroxyl, nitro, amino, imino, cyano, halo, thio, thioamido,. 
amidino, imidino, oxo, oxamidino, methoxamidino, imidino, guanidino, sulf onamido, carboxyl, 
formyl, alkyl, substituted alkyl, haloloweralkyl, loweralkoxy, haloloweralkoxy, loweralkoxy- 

10 alkyl, alkylcarbonyl, arylcarbonyl, aralkylcarbonyl, heteroarylcaibonyl, heteroaralkylcarbonyl, 
alkylthio, aminoalkyl, cyanoalkyl, benzyl, pyridyl, pyrazolyl, pyrrole, thiophene, imidazolyl, and 
the like. 

The term "linking moiety" refers to a covalent bond or an uncyclized divalent group, 
such as, for example, -CO-, -O-, -S-, -CHr, -NH-, and substituted or unsubstituted alkyl, 

15 alkenyl, alkynyl, carbonyl, alkoxycarbonyl groups as defined herein. 

The term "SMD? compound" refers to small molecule immunopotentiating compounds, 
that include small molecule compounds below about MW 1000 g/mol, preferably MW 800 
g/mol that are capable of stimulating or modulating a pro-inflammatory response in a patient In 
an embodiment, the SMDP compounds are able to stimulate human peripheral blood mononuclear 

iO cells to produce cytokines. Preferred SMIP compounds and derivatives thereof include, for 
example, aminoazavinyl compounds, benzazole compounds, acylpiperazine compounds; 
indoledione compounds, tetrahydroisoquinoline (THIQ) compounds, anthraquinone compounds, 
indanedione compounds, pthalimide compounds, benzocyclodione compounds, 
aminobenzimidazole quinolinone (ABIQ) compounds, hydraphthalimide compounds, 

15 pyrazolopyrimidine compounds, quinazilinone compounds, quinoxaline compounds, triazine 
compounds, tetrahydropyrrolidinoquinoxaline compounds, pyrrole compounds, benzophenone 
compounds, sterol compound, and isoxazole compounds. 
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Hie term "SMIS compound" refers to small molecule immunosuppressant compounds, 
that include small molecule compounds below about about MW 1000 g/mol, preferably MW 800 
g/mol, capable of suppressing or modulating a pro-inflammatory response in a patient 

Acylpiperazine compounds as described throughout this application include compounds 
of formula (HI) as shown below: 

D ^D 2 



: * ~(Rl0)n 

in 

wherein, 

R 9 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
arylalkyl, arylalkenyl, heteroarylalkyl, and heteroarylalkenyl; 
Rio is substituted or unsubstituted alkyl; 
n is an integer from 0-2; and. 
if Di is carbon than D2 is oxygen, D 3 is absent, and D4 is selected from the group consisting 

of substituted or unsubstituted aryl, heteroaryl, carbocycyl, alkoxyaryl, fused arylaryl, fused 

arylheteroaryl, and fused heteroarylaryl; or, 

if D\ is nitrogen than D2 is nitrogen, D4 is absent, and D3 is selected from the group 
• consisting of substituted or unsubstituted aryl, heteroaryl, carbocycyl, alkoxyaryl, fused 
arylaryl fused arylheteroaryl, and fused heteroarylaryl. 

Indoledione compounds as described throughout this application include compounds of 
formula (TV) as shown below: 




wherein, 
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Rn and R 12 are independently selected from the group consisting of H, nitro, halogen, 
amino, hydroxy, cyano, carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, alkylcarbonyl, alkylcarbonylamino, alkylaminocaibonyl, 
aminocaibonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heteiocyclylalkoxy, 
hetetocyclylalkyl, and caibocyclyl groups; and, 

R13 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
aiylalkyl, heteroarylalkyl, heterocyclyl, heterocyclylalkyl, and alkylbenzyl. 

Tetrahydroisoquinoline (THIQ) compounds as described throughout this application 
include compounds of formula (V) as shown below: 

F*14 




V 

wherein, 

L is a covalent bond or selected from the group consisting of -CH 2 -, -CO-, -O-, -S-, CHF, 
-NH-, -NR20-, where R20 is lower alkyl; 

R14 is selected from the group consisting of hydrogen, halogen, and substituted or 
unsubstituted alkyl; 

R15 is selected from the group consisting of substituted or unsubstituted carbocyclyl, aryl, 
arylalkyl, alkoxyaryl, heteroaryl, heterocyclyl; 

Ri6 is selected from the group consisting of hydrogen, halogen, and substituted or 
, unsubstituted alkyl; 

Rn is selected from the group consisting of hydrogen, halogen, and substituted or 
unsubstituted alkyl; and, 

Rig and R19 are independently selected from the group consisting of H, hydroxy, halogen, 
alkoxy, amino, unsubstituted alkyl, substituted alkyl, and alkylamino. 

Benzocyclodione compounds as described throughout this application include 
compounds of formula (VI) as shown below: 
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O 



l 24 



VI 



wheiein, 



E is selected from the group consisting of NR25 or CR26R27; 

R21, R23, and R24 are independently selected from the group consisting of H, hydroxy, 

halogen, alkoxy, amino, unsubstituted alkyl, substituted alkyl, and alkylamino; 

R22 is selected from the group consisting or H, hydroxy, halogen, alkoxy, amino, and 

unsubstituted or substituted alkyl, and alkylamino, arylalkyl, heteroarylalkyl, aryl, 

heteroaryl, arylcarbonyl, heterocyclyl, heterocyclylalkyl, and heteroarylcarbonyl; 

R25 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 

heterocyclyl, carbocyclyl, arylalkyl, heteroarylalkyl, and heterocyclyalkyl; 

R26 is selected from the group consisting of H, halogen, hydroxy, amino, and substituted 

or unsubstituted alkyl, carbonylalkyl, and alkylcarbonylalkyl; and, 

R27 is selected from the group aryl, arylalkyl, heteroarylalkyl, heterocyclyl, 

heterocyclylalkyl, carbocyclyl, arylcarbonylalkyl, and arylalkylcarbonyl. 

Aminoazavinyl compounds as described throughout this application include compounds 
of formula (VII) as shown below: 



G is either S or NH; 

R28 is selected from the group consisting of H, and substituted or unsubstituted alkyl, 
aryl, heteroaryl, heteroarylalkyl, arylalkyl, carbocyclyl, carbocyclylalkyl, heterocyclyl, 
and heterocyclylalkyl; 

Q is selected from the group consisting of hydrogen, substituted alkyl, unsubstituted alkyl, 
and aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclyl, substituted 



G 




vn 



wherein, 
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heterocyclyl, fused or unfused arylaryl, substituted arylaryl, arylheteroaryl, substituted 
arylheteroaryl, heteroarylheteroaryl, and substituted heteroarylheteroaryl; 

Vi is selected from the group consisting of alkyl, substituted alkyl, aryl, substituted aryl, 
arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, heteroaiylalkyl, 

5 substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, alkoxycaibonyl, 

carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted alkylcaibonyl, 
arylcarbonyl, aralkylcaibonyl, heteroaiylcarbonyl, heteroaralkylcarbonyl, 
alkylcarbonyloxy, arylcarbonyloxy, aralkylcarbonyloxy, heteroarylcarbonyloxy, 
heteroaralkylcaibonyloxy, alkylaminocarbonyloxy, arylaminocaibonyloxy, formyl, 

10 loweralkylcarbonyl, loweralkoxycaibonyl, aminocarbonyl, aminoaryl, alkylsulfonyl, 

sulfonamido, aminoalkoxy, alkylamino, heteroaryl amino , alkylcarbonylamino, 
alkylamincrcaibonylamino, aiylaminocaibonylamino, aralkylcarbonylamino, 
heteroarylcaibonylamino, arylcarbonylamino, cycloamidino, cycloalkyl, cycloimido, 
arylsulfonyl and arylsulfonamido; and, 

15 V 2 is selected from the group consisting of hydrodgen, halogen, alkyl, substituted alkyl, 

aryl, substituted aryl, arylalkyl, substituted arylalkyl, heteroaryl, substituted heteroaryl, 
heteroarylalkyl, substituted heteroarylalkyl, alkoxy, substituted alkoxy, aminocarbonyl, 
alkoxycarbonyl, carboxyl sulfonyl, methanesulfonyl, and substituted or unsubstituted 
alkylcaibonyl, arylcarbonyl, aralkylcarbonyl, heteroarylcarbonyl, heteroaralkylcarbonyl, 

50 alkylcarbonyloxy, arylcaibonyloxy, aralkylcarbonyloxy, heteroarylcarbonyloxy, 

heteroaralkylcaibonyloxy, alkylaminocaibonyloxy, arylaminocarbonyloxy, formyl, 
loweralkylcaibonyl, loweralkoxycaibonyl, aminocaibonyl, aminoaryl, alkylsulfonyl, 
sulfonamido, ajninoalkoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 
alkylaminocarbonylamino, arylaminocarbonylamino, aralkylcarbonylamino, 

55 heteroarylcarbonylamino, arylcaibonylamino, cycloamidino, cycloalkyl, cycloimido, 

arylsulfonyl and arylsulfonamido. 

Lactam compounds as described throughout this application include compounds of 
formula (VIII) as shown below: 
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R35 



vm 

wherein, 

Wi is selected from the group consisting of -OH, -OR36 groups, -NR37R38; 

W2 is selected from the group consisting of O, S, and NR39 groups; 

R29 and R30 join to form a 5 to 6 membeied substituted or unsubstituted ring comprising 

all carbon atoms or at least one O, N, or S atom; 

R35 and R39 may be the same or different and are selected from the group consisting of H, 
. -OH substituted and unsubstituted alkyl groups, substituted and unsubstituted aryl 
groups, -C(=0)H, -C(=0)-alkyl groups, and -C(=0>aryl groups; 
R31, R32, R33, and R34 may be the same or different and are independently selected from 
the group consisting of H, CI, Br, F, I, -NO2, -CN, -OH, -OR40 groups, -NR41R42 groups, 
-C(=0)R43 groups, -SH groups, substituted and unsubstituted amidinyl groups, 
substituted and unsubstituted guanidinyl groups, substituted and unsubstituted alkyl 
groups, substituted and unsubstituted aryl groups, substituted and unsubstituted alkenyl 
groups, substituted and unsubstituted alkynyl groups, substituted and unsubstituted 
heterocyclyl groups, substituted and unsubstituted alkylaminoalkyl groups, substituted 
and unsubstituted dialkylaminoalkyl groups, substituted and unsubstituted arylaminoalkyl 
groups, substituted and unsubstituted di arylaminoalkyl groups, substituted and 
unsubstituted (alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted 
heterocyclylalkyl groups, substituted and unsubstituted aminoalkyl groups, substituted 
and unsubstituted heterocyclylaminoalkyl groups, substituted and unsubstituted 
(Hheterocyclylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted 
(aryl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted hydroxyalkyl 
groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 
aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; 
R36 is selected from the group consisting of substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl 
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groups, substituted and unsubstituted heterocyclylalkyl groups, -C(=0)H, -C(=0)-alkyl 
groups, -C(=0>aryl groups, -C(=0)0-alkyl groups, -C(=0)0-aryl groups, -C(=0)NH 2 , - 
C(=0)NB(a]kyl) groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, - 
C(=0)N(aryl)2 groups, -C(=0)N(alkyl)(aryl) groups, -NH 2 , -NH(alkyl) groups, -NH(aryl) 
groups, -N(alkyl)2 groups, -N(alkyl)(aryl) groups, -N(aryi)2 
groups, -C(=0)NH(heterocyclyl) groups, -C(=0)N(heterocyclyl)2 
groups, -C(=0)N(alkyl)(heterocyclyl) groups, and -C(=0)N(aryl)(heterocyclyl) groups; 
R37 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, and substituted and unsubstituted heterocyclyl 
groups; 

R38 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl • 
groups, -OH, alkoxy groups, aryloxy groups, -NH2, substituted and unsubstituted 
heterocyclylalkyl groups, substituted and unsubstituted aminoalkyl groups, substituted 
and unsubstituted alkylaminoalkyl groups, substituted and unsubstituted 
dialkylaminoalkyl groups, substituted and unsubstituted arylaminoalkyl groups, 
substituted and unsubstituted di arylaminoalkyl groups, substituted and unsubstituted. 
(alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted alkylamino groups, 
substituted and unsubstituted arylamino groups, substituted and unsubstituted 
dialkylamino groups, substituted and unsubstituted diarylamino groups, substituted and 
unsubstituted (alkyl)(aryl)amino groups, -C(=0)H, -C(=0)-alkyl groups, -C(=0)-aryl 
groups, -C(=0)0-alkyl groups, -C(=0)0-aryl groups, -C(=0)NH 2 , -C(=0)NH(alkyl) 
groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, -C(=0)N(aryl)2 
groups, -C(=0)N(alkyl)(aryl) groups, -C(=0)-heterocyclyl groups, -C(=0)-0- 
heterocyclyl groups, -C(=0)NH(heterocyclyl) groups, -C(=0)-N(heterocyclyl)2 groups, - 
C(=0>N(alkyl)(heterocyclyl) groups, -C(=O)-N(aryl)0ieterocyclyl) groups, substituted * 
and unsubstituted heterocyclylaminoalkyl groups, substituted and unsubstituted 
diheterocyclylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted 
(aryl)(heterocyclyl)aminoalkyl groups, substituted and unsubstituted hydroxyalkyl 
groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 
aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; 



-243- 



WO 2004/092360 



PCT/US2004/011710 



R41 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, and substituted and unsubstituted heterocyclyl 
groups; 

R42 is selected from the group consisting of H, substituted and unsubstituted alkyl groups, 
substituted and unsubstituted aryl groups, substituted and unsubstituted heterocyclyl 
groups, -C(=0)H, -C(=0)-alkyl groups, -C(=0>aryl groups, - 

C(=0)NH 2 , »C(=0)NH(alkyl) groups, -C(=0)NH(aryl) groups, -C(=0)N(alkyl)2 groups, 
-C(=0)N(aryl)2 groups, -C(=0)N(aIkyl)(aryl) groups, -C(=O)0-alkyl 
groups, -C(=0)0aryl groups, substituted and unsubstituted aminoalkyl groups, 
substituted and unsubstituted alkylaminoalkyl groups, substituted and unsubstituted 
dialkylaminoalkyl groups, substituted and unsubstituted aryl aminoalkyl groups, 
substituted and unsubstituted diarylaminoalkyl groups, substituted and unsubstituted 
(alkyl)(aryl)aminoalkyl groups, substituted and unsubstituted heterocyclylalkyl groups, - 
C(=0>heterocyclyl groups, -C(=0)-0-heterocyclyl groups, -C(=0)NH(heterocyclyl) 
groups, -C(=O)-N0ieterocyclyl)2 groups, -C(=0)-N(alkyl)(heterocyclyl) groups; -C(=0> 
N(aryl)(heterocyclyl) groups, substituted and unsubstituted heterocyclylaminoalkyl 
groups, substituted and unsubstituted diheterocyclylaminoalkyl groups, substituted and 
unsubstituted (heterocyclyl)(alkyl)aminoalkyl groups, substituted and unsubstituted 
(heterocyclyl)(aryl)aminoalkyl groups, substituted and unsubstituted hydroxyalkyl 
. groups, substituted and unsubstituted alkoxyalkyl groups, substituted and unsubstituted 
aryloxyalkyl groups, and substituted and unsubstituted heterocyclyloxyalkyl groups; and 
R43 is selected from the group consisting of H, -NH 2 , -NH(alkyl) groups, -NH(aryl) 
groups, -N(alkyl>2 groups, -N(aryl)2 groups, -N(alkyl)(aryl) groups, -NH(heterocyclyl) 
groups, -N(heterocyclyl)(alkyl) groups, -N(heterocyclyl)(aryl) groups, -N(heterocyclyl)2 
groups, substituted and unsubstituted alkyl groups, substituted and unsubstituted aryl 
groups, -OH, substituted and unsubstituted alkoxy groups, substituted and unsubstituted 
heterocyclyl groups, substituted and unsubstituted aryloxy groups, heterocyclyloxy 
groups, -NHOH, -N(alkyl)OH groups, -N(aryl)OH groups, -N(alkyl)0-alkyl groups, - 
N(aryl)Oalkyl groups, -N(alkyl)0-aryl groups, and -N(aryl)0-aryl groups. 

Preferably R29 and R30 join together to form a substituted or unsubstituted phenyl ring. 



-244- 



WO 2004/092360 



PCTYUS2004/011710 



Hydropthal amide compounds as described throughout this application include 
compounds of formula (DQ as shown below: 




51^5; 
IX 



wherein, 



R44 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 
arylalkyl, heteroarylalkyl, fused arylaryl, unfused arylaryl, fused heteroarylaryl, unfused 
heteroarylaryl, fused arylheteroaryl, and unfused arylheteroaryl; 
: R45, R47, R49, and R51 may be the same or different and arc independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
. alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, heteroarylalkoxy, 
alkylamino, arylalkylamino, arylamino, heteroarylamino, heteroarylaminoalkyl, 
heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and carbocyclyl; and 
R45, R48, R50, and R52 may be the same or different and are independently selected from 
the group consisting of H, halogen, and substituted or unsubstituted alkyl groups. 

■i 

Benzophenone compounds as described throughout this application include compounds 
of formula (X) as shown below: 



R53 is independently selected from the group consisting of H, nitro, halogen, amino, 
hydroxy, cyano, carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, 
aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl; 




wherein, 
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R54 is independently selected from the group consisting of H, nitro, halogen, amino, 
hydroxy, cyano, caiboxcyclic acid, and substituted or unsubstituted alkyl, aryl, 
heteroaryl, alkoxy, alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, 
aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 
heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalfcyl, and carbocyclyl; and 
o and p are integers from 0-4. 

Isoxazole compounds as described throughout this application include compounds of 
formula (XI) as shown below: 




XL 

wherein, 

R55 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 

heteroaryl, heteroarylalkyl, heterocyclyl, and heterocyclylalkyl; 

R56 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 

heteroaryl, heteroarylalkyl, heterocyclyl, and heterocyclylalkyl; and, 

R57 is selected from the group consisting of H, halogen, hydoxy, and substituted or 

unsubstituted alkyl, aryl, heteroaryl, heterocyclyl, and carbonyl. 

Sterol compounds as described throughout this application include compounds of 
formula (XII) as shown below: 




wherein, 

R 58 is selected from the group consisting of substituted or unsubstituted aryl, arylalkyl, 
heteroaryl, heteroarylalkyl, heterocyclyl, and heterocyclylalkyl. 
Preferably R 58 is a pyranone substituent 
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Quinazilinone compounds as described throughout this application include compounds of 
formula (XHT) as shown below: 




5 wherein, 

R59 is selected from the group consisting of H, halogen, hydroxy, and substituted or 
unsubstituted alkyl, aminoalkyl, alklyaminoalkyl, alkoxy, dialkylaminoalkyl, 
hydroxyalkyl, alkenyl, alkynyl, carbocyclyl, caibocyclylalkyl, heterocyclyl, and 
heterocyclylalkyl; 

10 R#o is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 

arylalkyl, heteroarylalkyl, and heterocyclylalkyl; and, 

R*i, R^2» R63, and R*t may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, caiboxcyclic acid, and 
substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 
15 alkylcarbonylamino, alkylaminocaibonyl, aminocaibonyl, aiylalkoxy, heteroarylalkoxy, 

alkylamino, arylalkylamino, arylamino, heteroarylamino, heteroarylaminoalkyl, 
heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and caibocyclyl groups. 

Pyrrole compounds as described throughout this application include compounds of 
10 formula (XIV) as shown below: 

Res y FW 

Jet 

I 

Res 
XIV 

wherein, 

Res is selected from the group consisting of H, hydroxy, and substituted or unsubstituted 
5 alkyl, aryl, heteroaryl, heteroarylalkyl, arylalkyl, heteroarylaminoalkyl, arylaminoalkyl, 

heteroaryloxyalkyl, and aryloxyalkyl groups; 

&66> R67, Rds, and may be the same or different and are independently selected from 

the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
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substituted or unsubstituted alkyi, aryl, heteroaryl, aDcoxy, alkylcarbonyl, 
alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, heteroarylalkoxy, 
alkylamino, arylalkylamino, arylamino, heteroarylamino, heteroarylaminoalkyl, 
heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and caibocyclyl groups. 

Further preferred pyrrole compounds include those shown in Formula (XV): 




wherein: 

Ki is nitrogen, oxygen, or optionally substituted carbon; 

Wis absent or is selected from the group consisting of -O-, -S-, -S(O)-, -S0 2 -, -NH-, -NH- 
CO^-NR'CO-, -NHS0 2 -, -NR'S0 2 -, -CO-, -C0 2 ~, -CH 2 ~, -CF 2 ~, CHF, -CONH-, -CONR'-, 
and -NR'-, where R' is alkyi, substituted alkyi, cycloalkyl, aryl, heteroaryl, heterocyclo; 
Ar is optionally substituted aryl, heteroaryl, or a protecting group; : 
R 70 and R70' are independently selected from the group consisting of hydrogen and methyl; 
R711 R72, R-73, and R74 are independently selected from the group consisting of hydrogen, 
hydroxyl, and optionally substituted loweralkyl, cycloloweralkyl, cyclicaminoalkyl, alkyl- 
aminoalkyl, loweralkoxy, amino, alkylamino, alkylcarbonyl, arylcarbonyl, aralkylcarbonyl, 
heteroarylcarbonyl, heteroaralkylcarbonyl, aryl and heteroaryl; 
R75 and R 7 8 are independently selected from the group consisting of hydrogen, halo, and 
optionally substituted loweralkyl, cycloalkyl, alkoxy, amino, aminoalkoxy, carbonyloxy, 
aminocarbonyloxy, alkylcarbonylamino, arylcarbonylamino, aralkylcarbonylamino, hetero- 
arylcarbonylamino, heteroaralkylcarbonylamino, cycloimido, heterocycloimido, amidino, 
cycloamidino, heterocycloamidino, guanidinyl, aryl, heteroaryl, heterocycloalkyl, 
heterocyclocarbonyloxy, heteroarylcarbonyloxy, and arylsulfonamido; 
R 7 6 is selected from the group consisting of hydrogen, aryl, heteroaryl, substituted heteroaryl, 
heterocyclyl, and substituted heterocyclyl; 
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R77 is selected from the group consisting of hydrogen, hydroxy, halo, carboxyi, nitro, amino, 
amido, amidino, imido, cyano, sulfonyl, methanesulonyl, and substituted or unsubstituted alkyl, 
alkoxy, alkylcarbonyl, arylcaibonyl, aralkylcarbonyl, heteroarylcarbonyl, heteroaralkylcarbonyl, 
alkylcaibonyloxy, arylcaibonyloxy, aralkylcaibonyloxy, heteroarylcarbonyloxy, 

5 heteroaralkylcarbonyloxy, alkylaminocarbonyloxy, arylaminocarbonyloxy, fonnyl, 
loweralkylcaibonyl, loweralkoxycaibonyl, aminocarbonyl, aminoaryl, alkylsulfonyl, 
sulfonamido, aminoalkoxy, alkylamino, heteroarylamino, alkylcarbonylamino, 
alkylaminocarbonylamino, ajylaminocaibonylamino, aralkylcaibonylamino, 
heteroarylcaibonylamino, arylcarbonylamino, heteroarylcarbonylamino cycloamido, 

10 cyclothioamido, cycloamidino, heterocycloamidino, cycloalkyl, cycloimido, heterocycloimido, 
guanidinyl, aryl, heteroaryl, heterocyclo, heterocycloalkyl, arylsulfonyl and arylsulfonamido; 




R 82 O 



XVI 



wherein, 



R79, R«o, R*i, and R«2 may be the same or different and are independently selected from 
the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, and 
substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcarbonyl, 



alkylcarbonylamino, sulfonyl, aminosulfonyl, alkylaminocarbonyl, aminocarbonyl, 



arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 



carbocyclyl groups; and, 



and are taken together to form a substituted or unsubstituted 5-6 membered ring 
containing all carbon atoms or 1-2 heteroatoms selected from the group consisting of O, 
S, andN. 
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Quinoxaline compounds referred to throughout this application include tricyclic, partially 
unconjugated compounds optionally substituted with nitrogen heteroatoms as shown in the 
preferred quinoxaline embodiment (XVII) below: 



wherein, 

Ji is either C or N, 

1\ is selected from the group consisting of H, substituted aryl, unsubstituted aryl, 
substituted heteroaryl, and unsubstituted heteroaryl; 
J2 is either C or N, 

1% is selected from the group consisting of H, substituted aryl, unsubstituted aryl, 

substituted heteroaryl, and unsubstituted heteroaryl; 

J 3 is selected from the group consisting of -CO-, -NH-, and -N=; 

if J4 is -O- then J4 is absent; or, 

if J 4 is =C- then J4* is selected from the group consisting of H and substituted or 
unsubstituted alkyl, alkoxy, aryl, heteroaryl, heteroarylalkyl, arylalkyl, aminoalkyl, 
alkylamino, and alkylthio groups; and, 

Ras, Rs& R*7, Rss, and R&g may be the same or different and are independently selected 
from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, carboxcyclic acid, 
and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, alkylcaibonyl, 
alkylcarbonylamino, sulf onyl, aminosulfonyl, alkylaminocarbonyl, aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups. 

Triazine compounds refer to substituted 6-membered heterocyclic groups with 3 nitrogen 
atoms distributed throughout the ring. The preferred embodiments of the instant invention 
include those shown in structures (XVIII), (XIX) and (XX) shown below: 




xvn 
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O ?93 




xvm 



wherein, 



R90 is selected from the group consisting of substituted or unsubstituted alkyl, alkenyl, 
akynyl, aryl, heteroaryl, heteroarylalkyl, heteroarylalkenyl, arylalkyl, and arylalkenyl; 
R91 and R93 are independently selected from the group consisting of H, and unsubstituted 
alkyl; 

R91 is aryl; preferably phenyl, 



R^is selected from the group consisting of H, amino, alkyl, aminoalkyl, and halogen; 



arylalkylamino, heteroaryl, heteroarylamino, and heteroalkylamino; 

R96 and R97 are independently selected from the group consisting of H, halogen, and 

alkyl, preferably methyl; or, 

R96 may form a double bond with the nitrogen atom directly below it as indicated by the 
dashed line in the above structure; and, 




R95 



XIX 



wherein, 



R95 is selected from the group consisting of substituted or unsubstituted aryl, arylamino, 




XX 



wherein, 



-251- 



WO 2004/092360 



PCT/US2004/011710 



R98 is selected from the group consisting of H, substituted alkyl, and unsubstituted alkyl; 
preferably methyl, 

R99 is selected from the group consisting of H, substituted alkyl, and unsubstituted alkyl; 
preferably ethyl, 

5 R100 is selected from the group consisting of substituted or unsubstituted aryl, heteroaryl, 

alkoxyaryl, arylalkyl, and heteroarylalkyL 



Benzazole compounds as described throughout this application include compounds of 
10 formula (XXI) as shown below: 



R 7 W^v^ 



XXI 

wherein, 

A is selected from the group consisting of -O-, -S-, -NH-, and -NRg-; 

15 W is selected from the group consisting of -CEfe-, -O-, -S-, -NH-, and -NRg-; 

R 7 is selected from the group consisting of carbocyclyl, unfused carbocyclylcarbocyclyl, 
substituted aryl, unsubstituted aryl, substituted heteroaryl, unsubstituted heteroaryl, 
substituted fused arylheteroaryl, unsubstituted fused arylheteroaryl, substituted unfused 
arylaryl and unsubstituted unfused arylaryl; 

10 is selected from the group consisting of substituted or unsubstituted aryl, and 

heteroaryl; and, 

Rg is independently substituted or unsubstituted alkyl. 

Pyrazalopyrimidine compounds as described throughout this application include 
15 compounds of formula (XXII) as shown below: 
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wherein, 

Rioi is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, 
carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, 
alkylcarbonyl, alkylcarbonylamino, sulfonyl, aminosulfonyl, alkylaminocaibonyl, 
5 aminocarbonyl, arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, 

heteroarylamino, heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, 
heterocyclylalkyl, and carbocyclyl groups; 

R102 is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, 
carboxcyclic acid, and substituted or unsubstituted alkyl, aryl, heteroaryl, alkoxy, 
10 alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, arylalkoxy, 

heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; 

R103 is selected from the group consisting of H, nitro, halogen, amino, hydroxy, cyano, * 
15 carboxcyclic acid, trifluoromethyl, and substituted or unsubstituted alkyl, aryl, heteroaryl, 

alkoxy, alkylcarbonyl, alkylcarbonylamino, alkylaminocarbonyl, aminocarbonyl, 
arylalkoxy, heteroarylalkoxy, alkylamino, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, and 
carbocyclyl groups; 

10 R104 is selected from the group consisting of H and substituted or unsubstituted aryl, 

heteroaryl, arylalkoxy, heteroarylalkoxy, arylalkylamino, arylamino, heteroarylamino, 
heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, 
carbocyclylalkyl and carbocyclyl groups; 

R105 is selected from the group consisting of H and substituted or unsubstituted aryl, 
15 heteroaryl, arylalkoxy, heteroarylalkoxy, arylalkylamino, arylamino, heteroarylamino, 

heteroarylaminoalkyl, heterocyclyl, heterocyclylalkoxy, heterocyclylalkyl, 
carbocyclylalkyl and carbocyclyl groups; 
wherein at least one of R104 and R105 is not EL 



SMIP compounds identified by in-vitro (cellular or non-cellular assays) or in-vivo 
methods are thoroughly described in Methods 1 and 2 below. 
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Pharmaceutical compositions containing the compounds of the invention may be in any 
form suitable for the intended method of administration, including, for example, a solution, a 
suspension, or an emulsion. liquid carriers are typically used in preparing solutions, 
suspensions, and emulsions. Liquid carriers contemplated for use in the practice of the present 
5 invention include, for example, water, saline, pharmaceutical^ acceptable organic solvent(s), 
pharmaceutical^ acceptable oils or fats, and the like, as well as mixtures of two or more thereof. 
The liquid carrier may contain other suitable pharmaceutical^ acceptable additives such as 
solubilizers, emulsifiers, nutrients, buffers, preservatives, suspending agents, thickening agents, 
viscosity regulators, stabilizers, and the like. Suitable organic solvents include, for example, 

10 monohydric alcohols, such as ethanol, and polyhydric alcohols, such as glycols. Suitable oils 
include, for example, soybean oil, coconut oil, olive oil, safflower oil, cottonseed oil, and the 
like. For parenteral administration, the carrier can also be an oily ester such as ethyl oleate, 
isopropyl myristate, and the like. Compositions of the present invention may also be in the form • 
of microparticles, microcapsules, liposomal encapsulates, and the like, as well as combinations 

15 of any two or more thereof. 

Other additives include immunostimulatory agents known in the art. Immunostimulatory 
oligonucleotides and polynucleotides are described in PCT WO 98/55495 and PCT WO 
98/16247. U.S. Patent Application No. 2002/0164341 describes adjuvants including an 
unmethylated CpG dinucleotide (CpG ODN) and a non-nucleic acid adjuvant U.S. Patent 

10 Application No. 2002/0197269 describes compositions comprising an antigen, an antigenic 

CpG-ODN and a polycationic polymer. Other immunostimulatory additives described in the art . 
may be used, for example, as described in U.S. Patent No. 5,026,546; U.S. Patent No. 4,806352;. 
and U.S. Patent No. 5,026,543. 

A controlled release delivery system may be used, such as a diffusion controlled matrix * 

55 system or an erodible system, as described for example in: Lee, 'Diffusion-Controlled Matrix 
Systems", pp. 155-198 and Ron and Langer, "Erodible Systems", pp. 199-224, in 'Treatise on 
Controlled Drug Delivery", A. Kydonieus Ed., Marcel Dekker, Inc., New Yoric 1992. The 
matrix may be, for example, a biodegradable material that can degrade spontaneously in situ and 
in vivo for, example, by hydrolysis or enzymatic cleavage, e.g., by proteases. The delivery 

;0 system may be, for example, a naturally occurring or synthetic polymer or copolymer, for 
example in the form of a hydrogel. Exemplary polymers with cleavable linkages include 
polyesters, polyorthoesters, polyanhydrides, polysaccharides, poly(phosphoesters), polyamides, 
polyurethanes, poly(imidocarbonates) and poly(phosphazenes). 
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Hie compounds of the invention may be administered enterally, orally, parenterally, 
sublingually, by inhalation spray, rectally, or topically in dosage unit formulations containing 
conventional nontoxic pharmaceutical^ acceptable carriers, adjuvants, and vehicles as desired 
For example, suitable modes of administration include oral, subcutaneous, transdermal, 
5 transmucosal, iontophoretic, intravenous, intramuscular, intraperitoneal, intranasal, subdermal, 
rectal, and the like. Topical administration may also involve the use of transdermal 
administration such as transdermal patches or ionophoresis devices. The term parenteral 
includes subcutaneous injections, intravenous, intramuscular, intrasternal injection, or infusion 
techniques. 

10 Injectable preparations, for example, sterile injectable aqueous or oleaginous suspensions 

may be formulated according to the known art using suitable dispersing or wetting agents and 
suspending agents. The sterile injectable preparation may also be a sterile injectable solution or 
suspension in a Nontoxic parenterally acceptable diluent or solvent, for example, as a solution in 
1,3-propanediol. Among the acceptable vehicles and solvents that may be employed are water, 

15 Ringer's solution, and isotonic sodium chloride solution. In addition, sterile, fixed oils are 

conventionally employed as a solvent or suspending medium. For this purpose any bland fixed 
oil may be employed including synthetic mono- or diglycerides. In addition, fatty acids such as 
oleic acid find use in the preparation of injectables. 

Suppositories for rectal administration of the drug can be prepared by mixing the drug 

20 with a suitable nonirritating excipient such as cocoa butter and polyethylene glycols that are solid 
at ordinary temperatures but liquid at the rectal temperature and will therefore melt in the rectum 
and release the drug. 

Solid dosage forms for oral administration may include capsules, tablets, pills, powders, 
and granules. In such solid dosage forms, the active compound may be admixed with at least 

25 one inert diluent such as sucrose lactose or starch. Such dosage forms may also comprise, as is 
normal practice, additional substances other than inert diluents, e.g., lubricating agents such as 
magnesium stearate. In the case of capsules, tablets, and pills, the dosage forms may also 
comprise buffering agents. Tablets and pills can additionally be prepared with enteric coatings. 
Liquid dosage forms for oral administration may include pharmaceutically acceptable 

30 emulsions, solutions, suspensions, syrups, and elixirs containing inert diluents commonly used in 
the art, such as water. Such compositions may also comprise adjuvants, such as wetting agents, 
emulsifying and suspending agents, cyclodextrins, and sweetening, flavoring, and perfuming 
agents. 
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As to the mode of administration, it should be emphasized that it is the combination of 
therapeutic agents that gives rise to its synergistic therapeutic effect no matter whether the first 
and the second agent are administered together or separately. Therefore, the two agents may be 
given together in a single dose or in separate ones with respect to space and time. 
5 Effective amounts of the compounds of the invention generally include any amount 

sufficient to detectably treat viral infections. 

Successful treatment of a subject in accordance with the invention may result in the 
inducement of a reduction or alleviation of symptoms in a subject afflicted with a medical or 
biological disorder to, for example, halt the further progression of the disorder, or the prevention 
10 of the disorder. 

The amount of active ingredient that may be combined with the carrier materials to 
produce a single dosage form will vary depending upon the host treated and the particular mode 
of administration. It will be understood, however, that the specific dose level for any particular 
patient will depend upon a variety of factors including the activity of the specific compound ^ 
15 employed, the age, body weight, general health, sex, diet, time of administration, route of 
administration, rate of excretion, drug combination, and the severity of the particular disease 
undergoing therapy. The therapeutically effective amount for a given situation can be readily 
determined by routine experimentation and is within the skill and judgment of the ordinary 
clinician. 

20 The compounds of the present invention can also be administered in the form of 

liposomes. As is known in the art; liposomes are generally derived from phospholipids or other - 
lipid substances. Liposomes are formed by mono- or multilamellar hydrated liquid crystals that 
are dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
metabolizable lipid capable of forming liposomes can be used The present compositions in 

25 liposome form can contain, in addition to a compound of the present invention, stabilizers, 
preservatives, excipients, and the like. The preferred lipids are the phospholipids and 
phosphatidyl cholines flecithins), na tural and synthetic. Methods to form liposomes are 
known in the art See, for example, Prescott, Ed., Methods in Cell Biology, Volume XIV, 
Academic Press, New York, N.W., p. 33 et seq (1976). 

50 While the SMIP compounds of the invention can be administered as the sole active 

pharmaceutical agent, they can also be used in combination with one or more other agents used 
in the treatment of SARSs. Other representative agents useful in combination with the 
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compounds of the invention for the treatment of viral infections include, for example, interferon, 
ribavirin, gancyclovir and the like. 

When additional active agents are used in combination with the compounds of the present 
invention, the additional active agents may generally be employed in therapeutic amounts as 

5 indicated in the Physicians' Desk Reference (PDR) 53 rf Edition (1999), that is incorporated 
herein by reference, or such therapeutically useful amounts as would be known to one of 
ordinary skill in the art 

The compounds of the invention and the other therapeutically active agents can be 
administered at the recommended maximum clinical dosage or at lower doses. Dosage levels of 

10 the active compounds in the compositions of the invention may be varied so as to obtain a 

desired therapeutic response depending on the route of administration, severity of the disease and 
the response of the patient The combination can be administered as separate compositions or as 
a single dosage form containing both agents. When administered as a combination, the 
therapeutic agents can be formulated as separate compositions that are given at the same time or 

15 different times, or the therapeutic agents can be given as a single composition. 

Compounds of the present invention can be readily synthesized using the methods 
described herein, or other methods, that are well known in the art 

The compounds can be used in the form of salts derived from inorganic or organic acids. 
These salts include but are not limited to the following: acetate, adipate, alginate, citrate, 

20 aspartate, benzoate, benzenesulfonate, bisulfate, butyrate, camphorate, camphorsulfonate, 
digluconate, cyclopentanepropionate, dodecylsulfate, ethanesulfonate, glucoheptanoate, 
glycerophosphate, hemisulfate, heptanoate, hexanoate, fumarate, hydrochloride, hydrobromide, 
hydroiodide, 2-hydroxyethanesulfonate, lactate, maleate, methanesulfonate, nicotinate, 
2-napthalenesulfonate, oxalate, pamoate, pectinate, persulfate, 3-phenylproionate, picrate, 

25 pivalate, propionate, succinate, tartrate, thiocyanate, p-toluenesulfonate and undecanoate. Also, 
the basic nitrogen-containing groups can be quaternized with such agents as loweralkyl halides, 
such as methyl, ethyl, propyl, and butyl chloride, bromides, and iodides; dialkyl sulfates like 
dimethyl, diethyl, dibutyl, and diamyl sulfates, long chain halides such as decyl, lauryl, myristyl 
and stearyl chlorides, bromides and iodides, aralkyl halides like benzyl and phenethyl bromides, 

30 and others. Water or oil-soluble or dispersible products are thereby obtained. 

Examples of acids that may be employed to form pharmaceutical^ acceptable acid 
addition salts include such inorganic acids as hydrochloric acid, sulphuric acid and phosphoric 
acid and such organic acids as oxalic acid, maleic acid, succinic acid and citric acid. Basic 
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addition salts can be prepared in situ during the final isolation and purification of the compounds 
of formula CD, or separately by reacting carboxylic acid moieties with a suitable base such as the 
hydroxide, carbonate or bicaibonate of a pharmaceutical acceptable metal cation or with 
ammonia, or an organic primary, secondary or tertiary amine. Pharmaceutical acceptable salts 
5 include, but are not limited to, cations based on the alkali and alkaline earth metals, such as 
sodium, lithium, potassium, calcium, magnesium, aluminum salts and the like, as well as 
nontoxic ammonium, quaternary ammonium, and amine cations, including, but not limited to 
ammonium, tetramethylammonium, tetraethylammoniuin, methyl amine, dimethylamine, 
trimethylamine, triethylamine, ethylamine, and the like. Other representative organic amines 

10 useful for the formation of base addition salts include diethylamine, ethylenediamine, ' 
ethanolamine, diethanolamine, piperazine and the like. 

Various compounds and methods of their synthesis are disclosed in international patent 
application Publication Nos. WO02/1 8327 (benzamide and pyridylamide based compounds); 
WO0222598, and WO02/18383 (ABIQ based compounds); and WO 02/81443 (pthalamide base 

15 compounds), that have been found within context of this invention to be useful for immune 
potentiation. The entire disclosure of these U.S. and international publications is incorporated 
herein by this reference. Other compounds or intermediates of interest in the present invention 
were purchased from commercially available sources using the following method: the chemical 
structure of interest was drawn into the ACD-SC database (from MDL Information Systems). A 

JO search of the following companies/institutions, among others, retrieved the identified 

compound's supplier and purchasing information: ASDI, ASINEX, BIONET, OEBEMBRIDGE, . 
CHEMDIV, CHEMEX, CHEMSTAR, COMGENEX, CSC, INTERBIOSCREEN, LABOTEST, 
MAYBRIDGE, MICROSOURCE/GENESIS, OLIVIA, ORION, PEAKDALE, RYAN 
SCIENTIFIC, SPECS, TTMTEC, U OF FLORIDA, and ZELINSKY. 

i5 

BENZAZOLE COMPOUNDS 
Scheme 1 

Compounds of the invention containing a benzimidazole core may be prepared using a 
number of methods familiar to one of skill in the art. In one method, suitably functionalized 
>0 diamines may be coupled with various thioisocyanates to form the intermediate thioureas. 

Cyclization to form the benzimidazole moiety may be effected under known conditions such as 
with treatment carbodiimides or alkyl halides. Alternatively the diamines may be reacted 
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sequentially with carbonyl diimidazole and phosphoryl chloride followed by coupling with the 
appropriate amine. 

AiNCS (T^V^ 

1. (lmid) 2 CO /WN AMU i 

I 2- POC«3 (Q^>-CI I 

R 

Compounds containing the oxazole structure may similarly be prepared according to the 
5 methods above or according to other known general procedures. Haviv et. al. (J. Med Chem. 
1988, 31, 1719) describes a procedure for assembling oxazole cores wherein a hydroxy aniline is 
treated with ethyl potassium xanthate. The resulting sulfuryl benzoxazole may then be 
chlorinated and coupled with an amine. 

ff'V^ 2 ^^ N v 1.SOCI 2 ^^-N 

R R R 

10 Compounds containing a benzothiazole core may also be prepared according to known 

methods. An ortho-halothioisocyanate may be reacted with an amine to form a thiourea. 
Reiduction with NaH then allows formation of the thiazole ring. 




Benzothiazoles may generally be substituted in accordance with the present invention, 
15 such as through the following synthetic pathway: 
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Synthesis of 4-[(2-{ [4^hloro-3<trifluoromethyl)phenyl]amino}- 

( } 

!H-ben7.i midnvrA ^vDoxvl-N-methvIpvridine-2-carhox ami Hr 
The compound 4-[(2-{ [4^Woix>-3~(trifluoromethyl)phenyl]amino } -lH-benzimidazol-6- 

5 yl)oxy]-N-methylpyridine-2^arboxamide (159322) was synthesized as follows: 




Step 1. Synthesis of 4-[(4-amino-3-nitrophenyl^ 
A mixture containing 4-amino-3-nitrophenol (leq) and potassium bis(trimethylsilyl)amide (2eq) 
was stirred in dimethylform amide for 2 hours at room temperature. To this mixture was added 

10 (4^hloro(2-pyridyl))-Ar-methylcaxboxarnide (leq) and potassium carbonate (1.2eq) and stirred at 
90°C for 3 days. Hie reaction mixture was then concentrated and partitioned between ethyl 
acetate and water. The organic layer was separated and washed with brine, dried, filtered, and 
concentrated in vacuum to give brown solid. Purification on silica gel (2% triethyl amine / 50% 
ethyl acetate in hexane) gave 4-[(4-amino-3-nitrophen^ 

15 as an orange solid The product gave satisfactory NMR. HPLC, 3.39min; MS: MH* = 289. 

Step 2. Synthesis of 4-[(3,4-diaminophenyl)oxy]-i^^ The 
mixture containing [4^3-amino^nitrophenoxy)(2-pyridyl)]-iV- in methanol with catalytic 
amount of 10%Pd/C was hydrogenated until disappearance of the yellow color to yield the 
product amine. HPLC, 2.5mins; MS: MH+ = 259. 

JO 

Step 3. Synthesis of 4~[(2-{[4^Moro-3-(tri^ 
ben7imida7X)l-6"yl)oxy]"N-methylpyridine-2^arboxamide: 

The mixture containing 4-[(3,4-diaminophenyl)oxy]-iV-methyl^^ (leq) and 

4^hloro-3-(trifluoromethyl)benzeneisothiocyanate (leq) in tetrahydrofuran was stirred at room 
55 temperature for 16 hours to give the corresponding thiourea. To the resulting mixture was added 
l^thyl-3-(3-^iimethylaminopropyl)carbodiimide hydrochloride (2eq) and the mixture was stirred 
for another 10 hours. Hie mixture was concentrated and partitioned between ethyl acetate and 
water. Hie organic layer was washed with brine and dried. Purification on HPLC gave 4-[(2- 
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{[4K;Moro-3-(triflu(OT^ 
carboxamide. MS: MH* =462 

Synthesis of 4-({24(4-bromophenyl)amino]-l-methy 

IH-benzjm iHflznM- ylloxvVNHD^^^ 
5 The compound 4-({ 2-[(4-bromophenyl)arnino]-l-methyl-lH-benzimida^ 

methylpyridine-2-carboxamide (161651) was synthesized as follows: 




Step 1. Synthesis of 4-{[3-amino^(methylamino)phenyl]oxy}-iV-methylpyridine-2- 
carboxamide: A solution of 4-[(4~amincH3-nito^ 

10 (leq) in methylene chloride was treated with trifluoroacetic anhydride (leq) and stirred for 10 
minutes at 0 °C. The mixture was quenched with satd. NaHC(>3 solution. The organic layer was 
separated and washed with water, brine, dried and evaporated MS: MH + =385.2 

To a solution of the trifluroacetamide (leq) in a mixture of toluene, acetonitrile and 
sodium hydroride solution (50%) was added benzyltrimethylammonium chloride (leq) and 

15 dimethyl sulfate (1.2eq). The biphasic mixture was stirred overnight at room temperature and 
evaporated. The mixture was taken up in ethyl acetate, washed with water, brine, dried and 
evaporated. The crude product was purified by column chromatography eluting with 1 : 1 
hexanes and ethylacetate followed by 2% triethylamine in 1:1 hexanes and ethyl acetate followed 
by 2% triethylamine in 1: 1 hexanes and ethyl acetate to afford iV-methyl-4-{ [4-(methylamino)-3- 

20 nitrophenyl]oxy}pyridine-2-carboxamide as a reddish orange solid. MS: MH + = 303.1. 

The solution of nitromethylaniline in methanol was treated with 5% palladium on carbon 
and stirred under hydrogen atmosphere for 15 min. (until the disappearance of yellow coloration) 
at room temperature. The mixture was filtered and the filtrate was concentrated to provide 0.36 
g of the diamine 4-{[3-amincnHmethylamino)pheny 

25 MS: MH + = 273.3. 

Step 2. Synthesis of 4-({2-[(4-bromophenyl)amino]-l-methyl-lH-benzimidazol-5-- 
yl}oxy)-N-methylpyridine-2-carboxaniide: A solution of the diamine 4-{[3-amino-4- 
(methylamino)phenyl]oxy }-i^methylpyridine-2-carboxamide (leq) in methanol was treated with 
4-bromophenylisothiocyanate (leq) and stirred at 60 °C - 65°C for 2 hours. The reaction 

30 mixture was cooled down to room temperature and methyl iodide (leq) was added and stirred 
overnight at 60°C. The reaction was cooled to room temperature, evaporated, taken up in ethyl 
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acetate, and washed with water and brine, dried, and evaporated under reduced pressure. 
Column chromatography using a gradient solvent system of hexanes and ethyl acetate and either 
1:1 methylene chloride and acetone or 5% methanol in methylene chloride yielded the product as 
a half white powder. MS: MH*=4523 

AMINOBENZINffl)AZOLYU3UINOIJNONES 

Compounds of structure I may be synthesized from simple starting molecules as shown in 
Schemes 1-4 and exemplified in the Examples: As shown in Scheme 1, compounds of structure 
I may generally be prepared using aromatic compounds substituted with amines and carboxylic 
acid groups. 

Scheme 2. 

r A^^CC^H 



aQ0 2 H O O II I 





H 2 N 
heat 



As shown in Scheme 2, a substituted aromatic compound such as a substituted or 
unsubstituted 2-aminobenzoic acid may be reacted with an acyl halide such as methyl 2- 
(chlorocarbonyl)acetate to produce an amide that will react with a substituted or unsubstituted 
1 ,2-diaminobenzene. The resulting product is a 4-hydroxy-substituted compound of structure I. 
One skilled in the art will recognize that the procedure set forth in Scheme 1 may be modified to 
produce various compounds. 

A method for preparing 4-amino substituted compounds of structure I is shown in 
Scheme 3. As shown in Scheme 3, aromatic compounds substituted with amine and nitrile 
groups may be used to synthesize 4-amino substituted compounds of structure I. A compound 
such as ethyl 2-cyanoacetate may be reacted with ethanol to produce ethyl 3-ethoxy-3- 
iminopropanoate hydrochloride. Subsequent reaction with a substituted or unsubstituted 1,2- 
phenylenediamine provides substituted or unsubstituted ethyl 2-benzimidazol-2-ylacetate. 
Reaction of a substituted or unsubstituted ethyl 2~benzimidazol-2-ylacetate with an aromatic 
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compound having an amine and nitrile group such as substituted or unsubstituted 2- 
aminobenzonitrile with a base such as lithium bis(trimethylsilyl)amide or a Lewis acid such as 
tin tetrachloride provides the substituted or unsubstituted 4-amino substituted compound of 
structure L 



5 




Scheme ,4 illustrates a general synthetic route that allows for the synthesis of 4- 
dialkylamino and 4-alkylamino compounds of structure L An inspection of Scheme 3 shows that 
4-hydroxy substituted compounds of structure I may be converted into the 4-chloro derivative by 
reaction with phosphorous oxychloride or thionyl chloride. The 4-chloro derivative may then be 
10 reacted with an alkylamine or dialkylamine to produce the corresponding 4-alkylamino or 4- 
dialkylamino derivative. Deprotection affords the final 4-alkylamino or 4-dialkylamino 
compounds of structure L Other groups that may be reacted with the 4-chloro derivative in this 
manner include, but are not limited to, ROH, RSH, and CuCN. 
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Scheme 4. 




(P = protecting 
group) 



1 m r rrCT 

deprotect V JL/ — 




^0 " 
H 

As shown in Scheme 5, the synthesis of compounds of structure I having a H, alkyl 
group, aryl group, or heterocyclyl group in the 4-position may be accomplished using a 
substituted or unsubstituted 2-benzimidazol-2-ylacetate prepared as shown in Schemes 3 and 4. ■ 



Scheme 5. r 

y^COH 



/=VR' 



EtO^ r, ~ piperidine 

HOAc 

R" = H,alkyl 
aryl, heterocyclyl 



N x) 
H 



THIOSEMCARBAZONES 

General procedure for the preparation of thiosemicarbazones 
Scheme 6 



H H 



A solution of aldehyde (1.0 equiv.) and thiosemicarbazide (L05 equiv.) in acetic acid was 
stirred overnight Excess of acetic acid was removed to give a residue, that was washed with 
ethanol, or purified by preparative-HPLC to give the thiosemicarbazone. 
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Scheme 7 



A solution of aldehyde (1.0 equiv.), thiosemicaibazide (1.05 equiv.) and acetic acid (0.1 
equiv.) in methanol was stiired overnight Methanol was removed to give a residue, that was 
worked up as in Scheme 6. 



To a solution of { [(lE)-l-aza-2-(4-fluon>-3-niliophenyl)vinyl]amino } -aminomethane-1- 
thione in ethanol was added an arylamine (2.1 equiv.). The solution was stirred at room 
temperature until the starting fluoride disappeared. The solution was purified to the product 



A mixture of 4-(diethylamino>-2-hydroxybenzaldehyde (1 equiv.), benzylic bromide (1.2 
equiv.) and powder potassium carbonate in ethanol was stirred at room temperature for 2 days. 
Ethanol was removed, and the residue was dissolved in ethyl acetate and water. The organic 
15 layer was washed with aqueous NaHCOs and brine, dried over Na 2 S0 4 ., and concentrated. Hie 
residue was purified on silica gel eluting with ethyl acetate/hexane to give 4-(diethylamino>2- 
benzoxylic-benzaldehyde. 



5 Scheme 8 




10 



Scheme 9 



The aldehydes were converted to thiosemicarbazones according to Scheme 7. 




Scheme 10 
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A solution of 3,4-difluorobenz^iecarbonitiile (1 equiv.), amine (1.5 equiv.) and DIEA (2 
equiv.) in NMP was heated in a Smith Microwave (Personal Chemistry) for 30 minutes. The 
reaction mixture was purified on silica gel to give 4-substituted 3-fluorobenzenecarbonitrile. 

To a solution of nitrile in toluene at -78 °C was added DEBAL^H (1 M in toluene, 1.5 
5 equiv.). Hie reaction mixture was wanned to rt, and stirred for 16 h, and quenched with 
methanol/ethyl acetate/brine (1:1:4). After being stirred at rt for 30 min, the solution was 
extracted with ethyl acetate (3x). The combined organic layers were washed with aqueous 
NaHCCb, brine and concentrated. The aldehyde was purified on silica gel or directly converted 
to thiosemicarbazones (Scheme 7). 
10 Scheme 11 

A solution of 2,4,5-trifluorobenzenecarbonitrile (1 equiv.) and 4-arylpiperazine (1.2 
equiv.) and DIEA (1.2 equiv.) in THF was heated at 80 °C for 2 hours. The mixture was purified 
on silica gel to give 4-substituted 2,5-difluorobenzenecarbonitrile. 
Scheme 12 

15 To an alcohol (1.0 equiv) was added potassium f-butoxide in THF (1 M, 1.1 equiv). 

After 5 minutes, the solution was added to a solution of 4-N-substituted-2,5- 
difluorobenzenecarbonitrile (1 equiv.) in THF. The reaction mixture was stirred at rt overnight 
and quenched with aqueous ammonium chloride. The aqueous layer was extracted with.ethyl 
acetate (3x). The combined organic layers were washed with brine, and concentrated to give a 

20 residue, that was purified to give 4-N-substituted-2^-substimted-5-fluorobenzenecaiboni1rile. 

. . 4-N-substimted-2-0-substimted-5-fluorobenzenecarbonitrile was reduced with DIBAL-H 
to give a 4-N-substituted-2-0-substituted-5-fluorobenzaldehyde according to procedure in 
Scheme 10. 

The aldehyde was converted to the corresponding thiosemicarbazone using Scheme 7. 
25 Scheme 13 

A solution of 4-N-substituted-2,5^fluorobenzenecarbomtrile (1 equiv.), amine (1.5 
equiv.) and DIEA (2 equiv.) in NMP was heated in a Smith Microwave (Personal Chemistry) for 
30 minutes. The reaction mixture was purified on silica gel to give 4-N-substituted-2-N- 
substituted-5-fluorobenzenecarbonitrile. 
i0 4-N-substimted-2-N-substimted-5-fluorobenzenecarbonitrile was reduced with DIBAL-H 

according to procedure described in Scheme 10 to give 4-N-substituted-2-N-substituted-5- 
fluorobenzaldehyde. 

Preparation of ammo{3-[5-(3^Morophenyl)(2-fi^ 
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To a solution of 5-(3^hlorophenyl)furan-2-carbaldehyde (L0 equiv.) in THF at 0 °C was 
5 added MeMgBr in ether (3.0 equiv.) and stirred for 45 min. The reaction was quenched with 
water, diluted with ether and filtered through Celite. Hie organic layer was separated and 
washed with brine, dried overMgS04, and concentrated to give the l-[5-(3~chlorophenyl)-2- 
furyl]ethan-l-ol. 

To a solution of secondary alcohol(1.0 equiv.) in CH 2 Cl2was added MnC>2 (10 equiv.). 

10 The reaction was stirred overnight, filtered through Celite, and concentrated to give l-[5-(3- 
chlorophenyl)~2-furyl]ethan-l-one. 

To a mixture of ketone (1.0 equiv.), paraformaldehyde (2.0 equiv.), and dimethylamine 
hydrochloride (2.0equiv) and molecular sieves in ethanol was added concentrated hydrochloric i 
acid (cat). The reaction was refluxed overnight under nitrogen and the concentrated. A few 

15 drops of HC1 was added, and the mixture was worked up with DCM and water. The organic 
layer was discarded. The aqueous layer was adjusted to basic and extracted with DCM (3x). 
The organic layer was washed with brine, dried over MgSC>4, and concentrated to yield 3- 
(dimethylaiiiino)-l-[5K3^Morophenyl)(2-furyl)]propan-l-one. 

Thiosemicarbazdde (1.0 equiv.) was dissolved in MeOH upon heating under nitrogen. 

20 Aqueous sodium hydroxide (6 M, 9.0 equiv.) was added to the reaction. A methanol solution of 
3-(dimethylamino)-l~[5-(3^hlorophenyl)(2-furyl)]propan-l-one (1.0 equiv) was then added 
dropwise to the reaction mixture. The solvent was removed and the residue was dissolved in 
DCM and washed with water, brine, dried over MgS04, and concentrated. The final compound 
was purified by preparative-HPLC to give amino{3-[5-(3-chlorophenyl)(2-furyl)](2- 

25 pyrazolinyl) }methane-l-thione; LC/MS m/z 306.2 (MH+); Rt =3.06 minutes . 
Scheme 14 




Cl 
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To a solution of 4-pyridylmethylamine (1.0 equiv.) and triethylanrine (2.0 equiv.) in 
CHC1 3 was added CS 2 (1.0 equiv.)) and stirred overnight Hie reaction was cooled to 0 °C and 
ethyl chloroformate (1.0 equiv.) was added dropwise. Hie reaction was stirred for 15 min at 0 °C 
and then stirred at room temperature for 2 hrs followed by addition of (tert- 
5 butyl)oxycaibohydrazdde (1.2 equiv.). After stirring for an addition hour the mixture was 
washed with aqueous citric acid (5%), saturated NaHCCh, brine, dried over MgS0 4 , and 
concentrated The desired Boc protected thiosemicarbazide was purified using column 
chromatography. 

To a solution of Boc protected thiosemicarbazide (1.0 equiv.) dissolved in DCM was 

10 added HC1 in dioxane (2M, 8.3 equiv.) and stirred for 15 min. MeOH is then added to dissolve 

the precipitate, followed by addition of the furfural, and small amount of acetic acid (0.5 mL). 

The mixture is stirred overnight and the solvents are removed to give a residue purified by 
1, .... » 

preparative-HPLC to give the thiosemicarbazone. 

Synthesis of 4-[4-(4-methylpiperazin-l-yl)phCTOxymethyl]benzaldehyde 



15 



HN^^N— ° H Boc— N^~\l— ^ ^ — ONa 



To a solution of 4-piperazin-l-yl phenol (1 equivalent) in CHCI3, cooled to 0 °C,was 
added di-r-butyl dicarbonate (1 equivalent) in CHCI3 drop-wise. The solution was stirred at 0 °C 
for 1 hour before removing from the cold bath and stirring at ambient temperatures for 18 hours. 
20 The organic solution was washed aqueous NaHCOa and brine dried over MgSO* and 
concentrated the crude material was used without purification. 

A solution of the resulting 4-(l-B(X>piperazdn-4-yl)phenol (1 equivalent) in dry CH3CN 
was slowly added drop-wise to a slurry of NaH (1 equivalent) in dry CH3CN at room 
temperature under The slurry was stirred at room temperature for 2 hours before the solids 
25 were filtered and washed with Et20. 

,0 



P^-N / ^N--^^—ONa - Boc-N~ 



Sodium 4-(l-BOC-piperazin-4-yl)phenoxide (1 equivalent) and methyl 4- 
bromomethylbenzoate (1 equivalent) were combined in dry acetone and heated to reflux at 60 °C 
for 18 hours. The slurry was filtered and the filtrate was then concentrated to provide the crude 
30 methyl 4-[4-(l-BOC-piperazin-4-yl)phenoxymethyl]benzoate, that was used without 
purification. 
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To a slurry of IiAlEU (4 equivalents) in dry THF, cooled to 0 °C under N 2 , was slowly 
added drop-wise a solution of methyl 4-[4-(l-BOC-piperazin-4-yl)phenoxyiiiethyl]benzoate (1 
5 equivalent) in dry THF. Once the addition was complete, the slurry was heated to reflux at 80 
°C for 1 hour. Hie slurry was subsequently cooled to 0 °C and treated with water, 10% aq. 
NaOH and with water again. The resulting solids were filtered, and the filtrate was diluted with 
chloroform, washed with brine, dried over MgSC>4 and concentrated, providing the crude 4-[4-(4- 
methylpiperazin-l-yl)phenoxymethyl]benzyl alcohol that was used without purification 4 . 



10 




To a solution of DMSO (2.6 equivalents) in dry DCM , cooled to -78 °C under N 2 was 
added oxalyl chloride (1.1 equivalents) in DCM drop-wise. The solution was stirred at -78 °C 
for 5 minutes before a solution of 4-[4-(4-methylpiperazin-l-yl)phenoxymethyl]benzyl alcohol 

15 (1 equivalent) in DCM was added drop-wise, and allowed to stir at -78 °C for another 30 

minutes. Triethylamine (2.5 equivalents) was slowly dripped in before allowing the solution to 
reach ambient temperatures. The solution was washed with aqueous NaHCCb and brine, dried 
over MgS04 and concentrated to provide the crude 4-[4-(4-methylpiperazin-l- 
yl)phenoxymethyl]benzaldehyde that was converted to thiosemicarbazones according to Scheme 

20 7. , 
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PYRROLES 

Scheme IS 
Synthesis of Pyrrole 



5 




10 Preparation of te/f -butyl (2E)-3-(2,4-dichlorophenyl)prop-2-enoate (2). 

Neat DIC (1.4 eq) was added to a well stirred solution of cinnamate (1 eq), t-butyl 
alcohol (4 eq), DMAP (1 .4 eq) and CH2CI2 under argon at rt (Note - Hie cinnamate must be 
completely in solution that may require gentle warming. Allow the solution to cool to room 
temperature before adding the DIC. To avoid an exotherm on larger scales, it may be prudent to 
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dilute the DIC with CH2CI2 before the addition and have an ice bath ready.) After stirring for 8 
hours, the reaction develops a white precipitate. Hie reaction may be monitored by TLC eluting 
with 25% EtOAc/Hexane (Rf of product was 0.9). The entire reaction was loaded into a 
separatory funnel (washing with CH2CI2). The organic mixture was washed with citrate, sat aq. 

5 NaHC0 3 , water, and brine. Hie organic layer was dried (Na 2 S04), filtered, and concentrated to 
dryness to give the crude product as an oil. The crude oil was mixed with hexane and stirred for 
30min. The precipitate that forms was filtered over celite and the filtrate was evaporated. Hie 
hexane mixture was loaded onto a filter plug of silica and eluted with EtOAc/hexane (97:2 v/v). 
The first eluted UV active fractions are collected and evaporated to give >99% pure 2 (75-80% 

10 yields). 




3 



Preparation of fe/f -butyl 4-(2,4-dichlorophenyl)pyrrole-3-carboxy late (3). 

15 Dry ether was added to NaH (1.5 eq as the oil dispersion) under argon. After decanting 

off the ether via syringe, the NaH was suspended again with fresh ether under argon. A solution * 
of TOSMIC (1.1 eq) and 2 (1 eq) dissolved in a mixture of ether and DMSO was added dropwise 
to the stirred suspension of NaH at 0 °C over 20-30 min. The addition was mildly exothermic 
and evolved gas. After the addition, the reaction was allowed to warm to ambient rt The 

Z0 progress of the reaction was followed by TLC (25% EtOAc/Hexane, the UV active product was 
at Rf = 0.4) and LCMS until done (-2-3 h). Upon completion, the reaction was carefully 
quenched with sat. aq. NH4CI (added slowly to avoid strong gas evolution and exotherm) and 
diluted with ether. Hie layers were separated and the organic phase was washed with sat aq. 
NaHC0 3 , water, and brine. The crude dark solid can be purified by recrystallization. Best 

15 results were achieved either through recrystallization directly from a mixture of hot 

EtOAc/hexane (1:3 v/v) or by dissolving the crude product in minimal hot EtOAc followed by 
addition of hexane (-2 volumes of hexane based on the volume of EtOAc). The hot solutions 
were allowed to cool to room temperature and age over night The crystals were first filtered and 
then washed with hexane giving 99% pure product in 60-70 % yield. 

50 
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H 



NaH, DMF, 
RT, 70% yield 



4 



3 



5 



Preparation of tert -butyl 4-(2,4-dichIorophenyI)-l-[3-(l r 3^oxobenzo[c]azolin-2- 
yl)propyl]pyrrole-3-carboxylate (4). 

Solid NaH (1.5 eq as the oil dispersion) was added in small portions to a solution of 



pyrrole 3 (1 eq) and 3-bromopropyl phthalimide (1.2 eq) dissolved in DMF stirred at room 
temperature and flushed with argon. NOTE - Some gas evolves, but the temperature does not 
seem to rise above 40-50 °C. The reaction was stirred for 1.5 h at room temperature under argon. 
The reaction was followed by TLC (CHaCl^acetonitrile (95:5 v/v), the UV active product was at 

10 Rf = 0.5) and LCMS. Upon completion, the reaction was quenched with sat aq. NEUCl (add 
slowly to avoid strong gas evolution and exothenn). Sat aq. NaHC0 3 was then added to avoid 
an emulsion, and the basic organic mixture was extracted with ether. The combined ether layers 
were washed with sat aq. NaHC03, water, brine, dried Na 2 S04, filtered, and concentrated to 
dryness to give the crude product The crude product was purified by eluting through silica with 

15 EtOAc/Hexane (1:4 v/v). The purified product contained some residual 3-bromopropyl 

phthalimide, that did not interfere with subsequent synthetic steps. The material was taken on 
and used without further purification. Assume a quantitative yield. 



20 Preparation of terf-butyl l-(3-amiiiopropyl>4-(2,4^cM^ 



The Pthalimido Pyrrole 4 (1 eq) was dissolved in ethanol and hydrazine (3 eq) at room 
temperature under nitrogen. Upon heating to reflux, the reaction generated a white precipitate. 
Stir at reflux until complete (-2 h) by TLC (OT 2 Cl2/acetonitrile (95:5 v/v), the UV active 
15 product was at Rf= 0.2) and LCMS. Upon reaching completion, the reaction was allowed to 
cool to room temperature and the precipitate was vacuum-filtered off using a medium to fine 
cintered-glass filter. The filtrate was concentrated under reduced pressure to a gummy solid. 




4 



5 



(5). 
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Hie crude material was taken up in ethanol/EtOAc (1:1 v/v), stirred and the precipitate was 
filtered off in the same fashion as before. Hie filtrate was concentrated under reduced pressure 
and than dried in vacuo for 10-15 min. This process of adding ethanol/EtOAc, filtering and 
concentrating was done one more time or as needed to remove the majority of the white 
precipitate and residual hydrazine. Hie product was then dried in vacuo overnight Hie material 
was used without further purification. Once dried, the reaction yielded the product as a glass 
(-87% yield over 2 steps). 




5 



Preparation of terf-butyl 1^3-[(6-aniino-5-iutro^ 
dichlorophenyl)pyrrole-3-carboxylate (7). 

To the premixed dry reagents, pyrrole 5 (1 eq) and powdered 6-chloro-3-nitro-2- 
pyridylamine (6) (1.1 eq), was added the DMA followed by Hiinig's base (2 eq) sequentially with 
stirring at rt. The reaction was then heated to 80 °C overnight The reaction was followed by 
TLC (EtOAc/hexane (1:1 v/v), the UV active yellow product was at R f = 0.25), HPLC and 
LCMS. Upon completion as judged by HPLC, the reaction was allowed to cool to 70 ?C. 
Ethylene diamine (anhydrous) was then added to the reaction to destroy any remaining unreacted 
chloropyridine 6. After 15 min stirring at 70 °C, the reaction was cooled and quenched with the 
addition of sat. aq. NaHCC>3. The aqueous mixture was extracted with EtOAc, and the combined 
organic layers were washed with sat aq. NaHC0 3 , water, brine, dried, filtered, and concentrated 
to dryness to give the crude product as a brown-yellow solid The crude product was purified by 
flash chromatography eluted with EtOAc/hexane (4:6 v/v). The purified SnAr adduct 7 was 
isolated in 58% yield as a yellow solid. 



TFA (10% v/v) 

CH2CJ2, 
1% water, 
RT, -99% yield 
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Preparation of 1- { 3-[(6~amino-5-iutro(2-pyridyl))amiDo]ptopyl } -4-{2,4- 
dichlorophenyl)pynole-3-caiboxylic acid (8). 

In a vial, TFA (catalytic amount) was added to a stirred mixture of tert-butyl ester pyrrole 

7 (1 eq), water (-1%), and CH2CI2 at it The vial stirred at room temperature until done (-12 h. 

5 The reaction was then concentrated under reduced pressure at room temperature and dried in 

vacuo. Hie crude residue was dissolved again in CH2CI2 and concentrated under reduced 

pressure at rt Hie material was used in the final coupling step without further purification as the 



10 

Preparation of iV-((lS)-2-hydro:^^ 

pyridyl))amino]propyl^ (9,). 

(2SX+>2-Aminopropan-l-ol (1.5 eq) was added to a stirred mixture of acid (8) (1 eq), 
HBTU (1.5 eq), Htlnig's base (2 eq) and DMF (premixed sequentially in this order in a vial) at 
15 room temperature under argon. The reaction was stirred for 3-4 h until complete as shown by 
LCMS and HPLC. The reaction mixture was subsequently diluted with EtOAc, washed with 
NaHC0 3t and concentrated to afford a powder in a 70% yield. 

Nomenclature for the Example compounds was provided using ACD Name version 5.07 
software (November 14, 2001) available from Advanced Chemistry Development, Inc. Some of 
20 the compounds and starting materials were named using standard IUPAC nomenclature. • 

The compounds of Table 34 were synthesized following the synthetic methodology 
described above in the Examples and Schemes, and screened following methods 1 and 2 below. 
Hie precursors are readily recognizable by one skilled in the art and are commercially available 
from Aldrich (Milwaukee, WI) or Acros Organics (Pittsburgh, PA), among others. 



Candidate small molecule immuno-potentiators can be identified in vitro. Compounds 
are screened in vitro for their ability to activate immune cells. One marker of such activation is 
30 the induction of cytokine production, for example TNF-a production. Apoptosis inducing small 



TFA salt 




CHIR 154703 



Screening methods for SMDP/SMIS compounds 



Method 1 
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molecules may be identified having this activity. These small molecule immuno-potentiators 
have potential utility as adjuvants and immuno-therapeutics. 

In an assay procedure (High Throughput Screening (HTS)) for small molecule immune 
potentiators (SMIPs), human peripheral blood mononuclear cells (PBMQ, 500,000 per mL in 

5 RPMI 1640 medium with 10% FCS, were distributed in 96 well plates (100,000 per well) 

already containing 5jiM of compound in DMSO. The PBMCs were incubated for 18 h at 37°C in 
5% C0 2 . Their ability to produce cytokines in response to the small molecule compounds is 
determined using a modified sandwich BUS A. 

Briefly supernatants from the PBMC cultures were assayed for secreted TNF using a 

10 primary plate bound antibody for capture followed by a secondary biotinylated anti-TNF 
antibody forming a sandwich. The biotinylated second antibody was then detected using 
streptavidin-Europium and the amount of bound europium was determined by time resolved 
fluorescence. SMIP compounds were confirmed by their TNF inducing activity that was. 
measured in the assay as increased Europim counts over cells incubated in RPMI medium alone. 

15 "Hits" were selected based on their TNF-inducing activity relative to an optimal dose of 

lipopolysaccaride LPS (1 jig/ml), a strong TNF inducer. The rob\istness of the assay and low 
backgrounds allowed for the routine selection of hits with -10% of LPS activity that was 
normally between 5-10X background (cells alone). Selected hits are then subjected to 
confirmation for their ability to induce cytokines from multiple donors at decreasing 

20 concentrations. Those compounds with consistent activity at or below 5\jM are considered 
confirmed for the purposes of this assay. The assay is readily modified for screening for 
compounds effective at higher or lower concentrations. 

Method 2 

25 Each of the compounds in the above Table 34 elicited TNF-a production in human 

peripheral blood mononuclear cells. Many of the compounds showed activity at less than 20 [xM 
with respect to production of TNF-a. Many of these compounds showed activity at less than 5 
yM with respect to production of TNF-a. Many of these compounds showed activity in the 
production of TNF-a at less than 1.5 pM. 

30 For this reason, each of the R groups of any of the compounds listed in Table 34are 

preferred. Additionally, because of the excellent activity of each of the compounds, each of 
these compounds is individually preferred and is preferred as a member of a group that includes 
any or all of the other compounds and each compound is preferred in methods of modulating 
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iromunopotentiation and in methods of treating biological conditions associated therewith, for 
example to be used as a vaccine adjuvant Each of the compounds is also preferred for use in 
preparation of medicaments for vaccines, immunopotentiation, reducing tumor growth and in 
treating biological conditions mediated therefrom. 
5 In additon to the procedure described above, methods of measuring other cytokines (e.g. 

ILl-beta, 11^12, IL-6, IFN-gamma, IL-10 etc.) are well known in the art and can be used to find 
active SMD? compounds of the present invention. 

Compounds may be useful that cause production of TNF-a at higher concentrations, such 
as lOOpM, 200 jiM or 300jiM in the assays described herein. For example Loxoribine causes 
10 useful production of TNF-a at 300pM (see Pope et al Immunostimulatory Compound 7-Allyl-8- 
Oxoguanosine (Loxoribine) Induces a Distinct Subset of Murine Cytokines Cellular 
Immunology 162: 333-339 (1995)). 

Hie subject invention also includes isotopically-labeled antiviral compounds, that are 

15 structurally identical to those disclosed above, but for the fact that one or more atoms are 

replaced by an atom having an atomic mass or mass number different from the atomic mass or 
mass number usually found in nature. Examples of isotopes that can be incorporated into 
antiviral compounds of the invention include isotopes of hydrogen, carbon, nitrogen, oxygen, 
phosphorous, sulfur, fluorine and chlorine, such as 2 H, ^ 13 C, 14 C, 15 N, ls O, 17 0, 31 P, M P, ^S, 

20 lg F and ^Cl, respectively. Antiviral compounds of the present invention, derivatives thereof, and 
pharmaceutical acceptable salts of said compounds and of said derivatives that contain the 
aforementioned isotopes and/or other isotopes of other atoms are within the scope of this 
invention. Certain isotopically-labeled antiviral compounds of the present invention, for example 
those into which radioactive isotopes such as *H and 14 C are incorporated, are useful in drug 

25 and/or substrate tissue distribution assays. Tritiated, z.e., S H 9 and carbon-14, Le. y 14 C, isotopes are 
particularly preferred for their ease of preparation and detectability. Further, substitution with 
heavier isotopes such as deuterium, Le. 9 2 H, may afford certain therapeutic advantages resulting 
from greater metabolic stability, for example increased in vivo half-life or reduced dosage 
requirements and, hence, may be preferred in some circumstances. Isotopically labeled antiviral 

50 compounds of this invention and derivatives thereof can generally be prepared by carrying out 
known or referenced procedures and by substituting a readily available isotopically labeled 
reagent for a non-isotopically labeled reagent 

In accordance with the present invention, methods are provided for the administration of an 

effective amount of a SMIP compound to act as an adjuvant. Also provided are immunogenic 

15 compositions comprising a SMIP compound, an antigen, and optionally other adjuvants. 
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As adjuvants, the SMIP compounds are combined with antigens and delivery systems to 
form a final immunogenic composition or vaccine product 

As immunotherapeutics, the SMBP compounds are used alone or in combination with 
other therapies for treatment of SARS. 
5 Those of ordinary skill in the art will recognize that physiologically active antiviral 

compounds, SMIPs or SMESs that have accessible hydroxy groups are frequently administered in 
the form of phannaceutically acceptable esters. The antiviral compounds of this invention can 
be effectively administered as an ester, formed on the hydroxy groups, just as one skilled in 
pharmaceutical chemistry would expect It is possible, as has long been known in 

10 pharmaceutical chemistry, to adjust the rate or duration of action of the antiviral compound by 
appropriate choices of ester groups. 

Other compounds that can be used in combination with the therapeutic agents described 
herein include, pentoxifylline (PTX), methylprednisolone, trimetrexate (Neutiexin), Zadaxin 
(thymosin alpha 1), optionally substituted 5-aminomethinimino-3-methyl-4-isoxazolecaiboxylic 

15 acid phenylamides, cyclosporine A (CsA), 6-oxo- 1 ,4,5-thiadiazin [2,3-&]quinazoline, 3-amino- 
2(L^tWoxo^3fl)-quinazolinone, gangciclovir, glycyrrhizin, tetracyclines, aminoglycosides, 
quinolones, bicyclam (l,4-Bis(l,4,8,ll-tetraazacyclotetradec-l-ylmethyl)beiizene 
octahydrochloride dihydrate), rapamycin, wortmannin, enalapril, roquinimex/linomide, 
inactivin, DNCB, AG7088, 9-aminocamptothecin (CPT-11) , loxorobine, bropirimine, Ononase 

20 ® (ranpirnase), statins, such as: lovastatin— Mevacor®, pravastatin— Pravachol®, simvastatin— 
Zocor®, fluvastatin-Lescol® , atorvastatin — Lipitor® and rosuvastatin-Crestor®. 

As used herein, the term "effective amount" means an amount of antiviral compound of the 
compositions, kits and methods of the. present invention that is capable of treating the symptoms 
of the described conditions. The specific dose of a compound administered according to this 

15 invention will, of course, be determined by the particular circumstances surrounding the case 
including, for example, the compound administered, the route of administration, the state of 
being of the patient, and the severity of the condition being treated. 

Hie dose of an antiviral compound of this invention to be administered to a subject is 
rather widely variable and subject to the judgment of the attending physician. It should be noted 

JO that it may be necessary to adjust the dose of a compound when it is administered in the form of 
a salt, such as a laureate, the salt forming moiety of which has an appreciable molecular weight 

The following dosage amounts and other dosage amounts set forth elsewhere in this 
description are for an average human subject having a weight of about 65 kg to about 70 kg. The 
skilled practitioner will readily be able to determine the dosage amount required for a subject 

\5 whose weight falls outside the 65 kg to 70 kg range, based upon the medical history of the 
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subject and the presence of diseases, e.g., diabetes, in the subject Calculation of the dosage 
amount for other forms of the free base form such as salts or hydrates is easily accomplished by 
performing a simple ratio relative to the molecular weights of the species involved 

In general, the pharmaceutical compositions will include at least one antiviral compound 
5 in combination with a pharmaceutical^ acceptable vehicle, such as saline, buffered saline, 5% 
dextrose in water, borate-buffered saline containing trace metals or the like. Formulations may 
further include one or more excipients, preservatives, solubilizers, buffering agents, lubricants, 
fillers, stabilizers, etc. Methods of formulation are well known in the art and are disclosed, for 
example, in "Remington's Pharmaceutical Sciences," Mack Pub, Co., New Jersey (1991) or 

10 "Remington: The Science and Practice of Pharmacy," 20 th ed., Lippincott Williams & Wilkins, 
Baltimore, Maryland (2000), incorporated herein by reference. 

Pharmaceutical compositions for use within the present invention can be in the form of 
sterile, non-pyrogenic liquid solutions or suspensions, coated capsules, suppositories, lyophilized 
powders, transdermal patches or other forms known in the art. 

15 Many of the active ingredient antiviral compounds are known to be absorbed from the 

alimentary tract, and so it is usually preferred to administer a compound orally for reasons of 
convenience. However, the compounds may equally effectively be administered intravenously, 
subcutaneously, percutaneously, or as suppositories for absorption by the rectum or vagina, if 
desired in a given instance. All of the usual types of compositions may be used, including 

20 tablets, chewable tablets, capsules, solutions, parenteral solutions, troches, suppositories and 
suspensions. Compositions are formulated to contain a daily dose, or a convenient fraction of 
daily dose, in a dosage unit, that may be a single tablet or capsule or convenient volume of a 
liquid. 

Capsules are prepared by mixing the compound or compounds with a suitable diluent and 
25 filling the proper amount of the mixture in capsules. The usual diluents include inert powdered 
substances such as starch of many different kinds, powdered cellulose, especially crystalline and 
microcrystalline cellulose, sugars such as fructose, mannitol and sucrose, grain flours and similar 
edible powders. 

Tablets are prepared by direct compression, by wet granulation, or by dry granulation. 

30 Their formulations usually incorporate diluents, binders, lubricants and disintegrators as well as 
the compound or compounds. Typical diluents include, for example, various types of starch, 
lactose, mannitol, kaolin, calcium phosphate or sulfate, inorganic salts such as sodium chloride - 
and powdered sugar. Powdered cellulose derivatives are also useful. Typical tablet binders are 
substances such as starch, gelatin and sugars such as lactose, fructose, glucose and the like. 

35 Natural and synthetic gums are also convenient, including acacia, alginates, methylcellulose, 
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polyvinylpyrrolidine and the like. Polyethylene glycol, ethylcellulose and waxes can also serve 
as binders. 

A lubricant is generally necessary in a tablet formulation to prevent the tablet and punches 
from sticking in the die. Hie lubricant is chosen from such slippery solids as talc, magnesium 
5 and calcium stearate, stearic acid and hydrogenated vegetable oils. 

Tablet disintegrators are substances that swell when wetted to break up die tablet and 
release the compound or compounds. They include starches, clays, celluloses, algins and gums, 
more particularly, corn and potato starches, methylcellulose, agar, bentonite, wood cellulose, 
powdered natural sponge, cation-exchange resins, alginic acid, guar gum, citrus pulp and 
10 carboxymethylcellulose, for example, may be used as well as sodium lauryl sulfate. 

Tablets are often coated with sugar as a flavor and sealant, or with film-forming protecting 
agents to modify the dissolution properties of the tablet The compounds may also be formulated 
as chewable tablets, by using relatively large amounts of pleasant-tasting substances such as 
mannitol in the formulation, as is now well-established in the art 
15 When it is desired to administer a compound as a suppository, the typical bases may be 

used. Cocoa butter is a traditional suppository base, that may be modified by addition of waxes 
to raise its melting point slightly. Water-miscible suppository bases comprising, particularly, 
polyethylene glycols of various molecular weights are in wide Use. 

The effect of the compounds may be delayed or prolonged by proper formulation. For 
20 example, a slowly soluble pellet of the compound may be prepared and incorporated in a tablet 
or capsule. The technique may be improved by making pellets of several different dissolution 
rates and filling capsules with a mixture of the pellets. Tablets or capsules may be coated with a 
film that resists dissolution for a predictable period of time. Even the parenteral preparations 
may be made long-acting by dissolving or suspending the compound or compounds in oily or 
>5 emulsified vehicles that allow dispersion slowly in the serum. 

The combinations of this invention may be administered in a controlled release formulation 
such as a slow release or a fast release formulation. Such controlled release formulations of the 
combination of this invention may be prepared using methods well known to those skilled in the 
art. The method of administration will be determined by the attendant physician or other person 
\0 skilled in the art after an evaluation of the subject's condition and requirements. 

The term "prodrug" means compounds that are transformed in vivo to yield an antiviral 
compound of the present invention. The transformation may occur by various mechanisms, such 
as through hydrolysis in blood. A good discussion of the use of prodrugs is provided by T. 
Bfiguchi and W. Stella, '"Pro-drugs as Novel Delivery Systems," Vol. 14 of the A.C.S. 
»5 Symposium Series, and in Bioreversible Carriers in Drug Design, ed Edward B. Roche, 
American Pharmaceutical Association and Pergamon Press, 1987. The term, "prodrug" also 
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encompasses mutual prodrugs in which one or more antiviral compounds are combined in a 
single molecule that may then undergo transformation to yield the individual antiviral 
compounds of the present invention. 

For example, if an antiviral compound of the present invention contains a caiboxylic acid 
5 functional group, a prodrug can comprise an ester formed by the replacement of the hydrogen 
atom of the acid group with a group such as (Ci-Cg)alkyl, (C2-Ci2)alkanoyloxymethyl, 1- 
(alkanoyloxy)ethyl having from 4 to 9 carbon atoms, l-methyl-l-(alkanoyloxy)-ethyl having 
from 5 to 10 carbon atoms, alkoxycarbonyloxymethyl having from 3 to 6 carbon atoms, 1- 
(alkoxycarbonyloxy)ethyl having from 4 to 7 carbon atoms, 1-methyl-l- 
10 (alkoxycarbonyloxy)ethyl having from 5 to 8 carbon atoms, N-(alkoxycarbonyl)aminomethyl 
having from 3 to 9 carbon atoms, l-(N-(alkoxycarbonyl)amino)ethyl having from 4 to 10 carbon 

* atoms, 3-phthalidyl, 4-crotonolactonyl, gamma-butyrolacton-4-yl, di-N^-(Ci<;2)alkylaniino(C2- 
C3>alkyl (such as P-dimethylaminoethyl), caibamoyl-(Ci-C 2 )alkyl, NJN-di(Cr 
C2)alkylcarbamoyl-(Ci-C2)alkyl and piperidino-, pyrrolidino- or morpholino(C2-C3)alkyl. 

15 Similarly,- if an antiviral compound of the present invention comprises an alcohol 

functional group, a prodrug can be formed by the replacement of the hydrogen atom of the 
alcohol group with a group such as (Ci-C6)alkanoyloxymethyl, l-((Ci-Q)alkanoyloxy)ethyl, 1- 
methyl-l-((Ci-C*)alkanoyloxy)ethyl, (Ci-Q)alkoxycarbonyloxymethyl, N-(Ci- 
C^)alkoxycarbonylaminomethyl, succinoyl, (Ci-Q)alkanoyl, a-annno(Ci-C4)alkanoyl, arylacyl 

10 . and a -aminoacyl, or a-aminoacyl-a-aminoacyl, where each a-aminoacyl group is independently 

• selected from the naturally occurring Lramino acids, P(0)(OH>2, -P(0)(0(Ci-C6)alkyl)2 or 
glycosyl (the radical resulting from the removal of a hydroxyl group of the hemiacetal form of a 
carbohydrate). 

If an antiviral compound of the present invention comprises an amine functional group, a 
15 prodrug can be formed by the replacement of a hydrogen atom in the amine group with a group 
such as R x -carbonyl, R x O-carbonyl, NR^^-carbonyl where R x and R* are each independently 
((Ci-Cio)alkyl, (C3-C7)cycloalkyl, benzyl, or R x -carbonyl is a natural a -aminoacyl or natural a- 
aminoacyl-natural a-aminoacyl, -C(OH)C(0)OY X wherein (Y x is H, (Ci-C6)alkyl or benzyl), - 
C(OY™) Y X1 wherein Y x0 is (C1-C4) alkyl and Y xl is ((Ci-C^alkyl, carboxytQ-QOalkyl, 
JO amino(Ci-C 4 )alkyl or mono-N- or di-N^-(Ci-Q)alkylaminoalkyl, -CCY* 2 ) Y* 3 wherein Y^ is 
H or methyl and Y 30 is mono-N- or di-N,N-(Ci<^)alkylamino, morpholino, piperidin-l-yl or 
pyrrolidin-l-yl. 

The compositions for use in accordance with the present invention may be formulated in a 
conventional manner using one or more physiologically acceptable carriers or excipients. 
*5 Antiviral, SMIP, SMB, or other immunomodulating compounds are prepared or obtained as 
described herein and in the US Patents and published international patent applications listed in 
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Table 1, Table 2, Table 34 and Table 35. Hie antiviral compounds can be formulated in 
pharmaceutical^ acceptable compositions suitable for delivery to the lungs. Particular 
formulations include dry powders, liquid solutions or suspensions suitable for nebulization and 
propellant formulations suitable for use in metered dose inhalers. The preparation of such 

5 formulations is well know to those skilled in the art, and is described in US Patent Nos. 

5,814,607 and 5,654,007 and in the US Patents and published international patent applications 
listed in Table 3 the disclosures of which are incorporated herein by reference. 

Dry powder formulations will comprise an antiviral compound in a dry, optionally 
lyophilized form with a particle size within a preferred range for deposition within the lung. 

10 Typically the particle size for deposition in the lung will range between 1 and 5 /xm. When 

systemic delivery of the antiviral compound via absorption from the lung into the bloodstream is 
desired the antiviral compound formulation particle size is generally between 0.1 and 2 pm in 
size; The preferred size range of particles can be produced using methods such as jet-milling, 
spray drying and solvent precipitation, for example. Dry powder devices typically require a 

15 powder mass in the range from about 1 mg to 100 mg to produce an aerosolized dose. Thus, the 
antiviral compound will typically be combined with a pharmaceutically acceptable dry bulking 
powder. Preferred dry bulking powders include sucrose, lactose, trehalose, human serum 
albumin (HS A), phospholipids and glycine as well as those disclosed in the documents listed in 
Table 3. Dry powders can be administered to the subject in conventional dry powder inhalers. 

20 For liquid formulations the antiviral compound can be dissolved in any recognized 

physiologically acceptable carrier for use in delivery of aerosolized formulations. Such carriers 
include buffered and unbuffered aqueous solutions for water soluble compounds, and • v 
physiological solutions including saline solution (preferably between 0.2 and 2 N NaCl). For 

• . antiviral compounds with limited solubility, other liquid vehicles such as ethanol, propylene 

25 glycol and ethanol-propylene combinations may be used. The antiviral compounds may also be 
administered as solids in suspension. i 

For administration by inhalation, the compositions for use according to the present 
invention are conveniently delivered in the form of an aerosol spray administered via pressurized 
packs or a nebulizer, with the use of a propellant, e.g., air, dichlorordifluoromethane, 

30 dichloroterafluoroethane or other suitable gas. Preferably, for incorporation into the aerosol 
propellant, the antiviral compound formulations of the present invention will be processed into 
respirable particles as described above for the dry powder formulations. The particles are then 
suspended in the propellant, optionally being coated with a surfactant to enhance their 
disbursement. In the use of a pressurized aerosol the dosage unit may be determined by 

55 providing a valve to deliver a metered amount 
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Commercially available jet nebulizers are available and may be used to deliver aerosolized 
antiviral compound to a subject Such jet nebulizers include, but are not limited to, those 
supplied by AeroTech 1 1 (CIS-US, Bedford, Mass.). In addition, for delivery of aerosolized 
antiviral compound to the lungs of a subject an oxygen source can be attached to the nebulizer 
5 providing a flow rale of, for example, 10 L/min. In general, inhalation is performed over a 5-40 
minute time interval through a mouthpiece during spontaneous respiration. Hie present 
invention provides for novel compositions comprising a suitable carrier and aerosolized antiviral 
compound in doses sufficient to reduce or ameliorate viral load and SARS symptoms in subjects 
having SARS. Such doses can be lower than corresponding systemic doses that may be used to 
10 those generally used to reduce or ameliorate viral load and SARS symptoms in subjects having 
SARS. 

The antiviral, SMIP, SMIS, and immunomodulating compositions of the present invention 
may be administered with a steroidal anti-inflammatory drag for the treatment of SARS and 
SARS symptoms. Examples of steroidal anti-inflammatory drugs of the invention include 

15 hydrocortisone, prednisolone, dexamethasone, triamcinolone acetonide, fluocinolone acetonide, 
fludrocortisone acetate, betamethasone, etc. 

The antiviral compound composition of the invention is nebulized predominantly into 
particle sipes allowing a delivery of the drug into the terminal and respiratory bronchioles. For 
efficacious delivery of antiviral compound to the lung endobronchial space of airways in an 

10 aerosol, the formation of aerosol particles having mass medium average diameter predominantly 
between 1 to.5 fim is necessary. The formulation must additionally provide conditions that 
would not adversely affect the functionality of the airways. Consequently, the formulation must 
contain enough of the drug formulated under the conditions that allow its efficacious delivery 
while avoiding undesirable reaction. 

15 For liquid solutions and suspensions, the choice of the nebulizer is made from among 

commercially available nebulizers. The jet nebulizers known as Sidestream O, obtained from 
Medicaid and Pari LCS, LC Plus, and eFlow obtained from Pari Respiratory Equipment, 
Richmond, Virginia, are examples of typical nebulizers suitable for the practice of the invention. 
Ultrasonic nebulizers that produce appropriate particle sizes of about 1 to 5 /im such as 

»0 Aerosonic by DeVilbiss and UltraAire by Omron are also suitable. 

Advantageously, the present invention also provides for a kit for use by a consumer for the 
treatment and/or prevention of SARS. Such a kit comprises: (a) a pharmaceutical composition 
comprising a therapeutically effective amount of at least one compound from among those 
described herein or listed in Table 34 and Table 35 or described in the US Patents and published 

5 international patent applications listed in Table 1, Table 2, and Table 35 and a pharmaceutical^ 
acceptable carrier, vehicle or diluent; (b) a container for holding the pharmaceutical composition; 

-282- 



WO 2004/092360 



PCT/US2004/011710 



and, optionally, (c) instructions describing a method of using the pharmaceutical compositions 
for the treatment and or the prevention of SARS. Hie kit may optionally contain a plurality of 
antiviral compounds for the treatment of SARS wherein the and viral compounds are selected 
from 3C-like protease inhibitors and papain-like protease inhibitors. In a further embodiment, 

5 the kit contains an antiviral compound which is an RNA-dependent RNA polymerase inhibitor. 
When the kit comprises more than one antiviral compound, the antiviral compounds contained in 
the kit may be optionally combined in the same pharmaceutical composition. 

A "kit" as used in the instant application includes a container for containing the separate 
compositions such as a divided bottle or a divided foil packet The container can be in any 

10 conventional shape or form as known in the art that is made of a pharmaceutical^ acceptable 
material, for example a paper or cardboard box, a glass or plastic bottle or jar, a re-sealable bag 
(for example, to.hold a "refill" of tablets for placement into a different container), or a blister 
pack with individual doses for pressing out of the pack according toa therapeutic schedule. Hie 
container employed can depend on the exact dosage form involved, for example a conventional 

15 cardboard box would not generally be used to hold a liquid suspension. It is feasible that more 
than one container can be used together in a single package to market a single dosage form. For 
example, tablets may be contained in a bottle that is in turn contained within a box. 

An example of such a kit is a so-called blister pack- Blister packs are well known in the 
packaging industry, and are being widely used for the packaging of pharmaceutical unit dosage 

20 forms (tablets, capsules, and the like). Blister packs generally consist of a sheet of relatively stiff 
material covered with a foil of a preferably transparent plastic material. During the packaging 
process, recesses are formed in the plastic foil. The recesses have the size and shape of 
individual tablets or capsules to be packed or may have the size and shape to accommodate 
multiple tablets and/or capsules to be packed. Next, the tablets or capsules are placed in the 

25 recesses accordingly and the sheet of relatively stiff material is sealed against the plastic foil at 
the face of the foil that is opposite from the direction in which the recesses were formed. As a 
result, the tablets or capsules are individually sealed or collectively sealed, as desired, in the 
recesses between the plastic foil and the sheet Preferably the strength of the sheet is such that 
the tablets or capsules can be removed from the blister pack by manually applying pressure on 

30 the recesses whereby an opening is formed in the sheet at the place of the recess. The tablet or 
capsule can then be removed via said opening. 

It maybe desirable to provide a written memory aid, where the written memory aid is of the 
type containing information and/or instructions for the physician, pharmacist or subject, e.g., in 
the form of numbers next to the tablets or capsules whereby the numbers correspond with the 

35 days of the regimen which the tablets or capsules so specified should be ingested or a card that 
contains the same type of information. Another example of such a memory aid is a calendar 
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printed on the card e.g. 9 as follows "First Week, Monday, Tuesday," ...etc "Second Week, 

Monday, Tuesday, ..." etc. Other variations of memory aids will be readily apparent A "daily 
dose" can be a single tablet or capsule or several tablets or capsules to be taken on a given day. 
Also a daily dose of one or more components) of the kit can consist of one tablet or capsule 
5 while a daily dose of another one or more components) of the kit can consist of several tablets 
or capsules. 

Another specific embodiment of a kit is a dispenser designed to dispense the daily doses 
one at a time in the order of their intended use. Preferably, the dispenser is equipped with a 
memory-aid, so as to further facilitate compliance with the regimen. An example of such a 
10 memory-aid is a mechanical counter that indicates the number of daily doses that has been 
dispensed Another example of such a memory-aid is a battery-powered micro-chip memory 
coupled with a liquid crystal readout, or audible reminder signal which, for example, reads out 
the date that the last daily dose has been taken and/or reminds one when the next dose is to be 
taken. 

15 

EXAMPLES 

Example 1-EXAMPLE of a SARS VIRUS ISOLATE 

A SARS virus was isolated from clinical specimens of a patient in Frankfurt, Germany 
(FRA). The isolate was grown in Vero cells. RNA of the SARS virus was extracted and: 
20 amplified by RT-PCR. Nucleotide sequence of the viral genome was determined by direct 
sequencing of the PCH product Computer analysis was used to predict the features of the 
genome, to compare it to previously known coronaviruses and to the sequence of different SARS 
virus isolates. 

More specifically, isolation and sequence was performed as follows. After the third 
25 passage of the SARS virus in Vero cells, viral particles were purified by ultra centrifugation 

from 3xl0 7 cells supernatant Viral RNA was extracted by Triazol method (Gibco-BRL). Viral 
RNA (200 ng) was transcribed into cDNA with avian RNaseH- thermostable reverse 
transcriptase following the instructions of the manufacturer (ThermoScript RT System, 
Invitrogen). Briefly, either 50 pmoles of oligo (dTho (SEQ ID NO: 7389) or 25 ng of random 
30 hexamers were used to prime the RT reaction in a 20 fil final volume. Amplification and 
sequencing of the SARS genome were accomplished by direct sequencing of PCR products 
obtained with: i) specific primers from conserved regions of homology found through multiple 
alignment among known coronaviruses; ii) oligonucleotides designed around short sequences of 
SARS isolates available on the Web through WHO network laboratories; iii) degenerate primers 
35 to amplify the cDNA mixture with multiple overlapping fragments as end products. Gap closure 
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was realized by long distance PCR with high fidelity Taq (Expand High Fidelity system, Roche) 
using primers designed on selected fragments. Sequence was collected by primer walking using 
a BigDye terminator chemistry (Applied Biosystems) and an automated DNA sequencer (3700 
capillary model, Applied Biosystems). After obtaining a first pass of the entire genome, a set of 
5 both forward and reverse primers were used to amplify and sequence de novo the genome using 
as a template DNA segments of 2 kb on average. Readings from overlapping fragments were 
automatically assembled by AutoAssembler (Applied Biosystems) and the 29,740 bp contiguous 
edited manually. 

Computer analysis of the sequence was performed as follows. The GCG Wisconsin 
10 Package suite (version 10.0) was used for computer analysis of gene and protein sequences. Hie 
PSORT program (http://psorLnibb.acjp/) was used for localization predictions. For secondary 
structure analysis, the PHD software available on the Web at 

http://cubiabioacolumbia.edu/prefa was applied The PSI-BLAST algorithm was used for 

homology searches (http://www.ncbinlm.nihgovftlast) using the non-redundant protein database. 

15 ClustalW was applied to obtain multiple sequence alignments of gene and protein sequences. 
The LearnCoil-VMF program was used to predict coiled-coil regions in the spike proteins 
(http://lwrncoU-mtf.lcs.miL Leucine zippers were predicted with the program 2ZIP, 

available at http://2Zip.molgenjnpg.de. 

Phylogenetic analysis was performed using the neighbor-joining algorithm as implemented 

20 in the program NEIGHBOR within the Phylogeny Inference Package (Phylip) (Felsenstein J 
1993, program distributed by the author). Bootstrap analysis was always performed with 100 
replicates using the program Seqboot Trees were handled and displayed using TreeView. The 
program HMMER was used to generate sequence profiles from multiple sequence alignments of 
the S 1 domains of spike proteins. Subsequently, the HMMPFAM program was used to compare 

25 the SI domain of SARS spike to the profiles. 

The genome of this SARS virus isolate is 29,740 bases long and the overall structure of the 
genome is similar to that of the three known groups of corona viruses. Starting from the 5 1 end a 
leader sequence, an untranslated region (UTR) and two overlapping open reading frames coding 
for one polyprotein containing the enzymes necessary for replication can be identified. They are 

30 followed by a region coding for the spike (S), envelope (B), matrix (M), nucleocapsid (N) 

structural proteins and eight additional ORFs specific for the SARS virus. At the 3-end of the 
genome a UTR with a poly(A) is located. The overall homology to coronaviruses groups 1, 2 
and 3 is low and therefore the SARS virus belongs to a new group (group 4) of coronavirus. 
More detailed analysis of the spike protein amino acid sequence shows that the SARS virus 

35 isolate is more closely related to coronavirus group 2. 
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The complete genome sequence of the SARS virus isolate is 29,740 bp in length. Hie 
sequence is available on Genbank and has a GC content of 40.8%, comparable with that of 
known viruses of the same family. Genome structure is similar to that of other coronaviroses. 14 
open reading frames have been predicted. Hie principal features of the genome and gene 
5 products are illustrated reported in Figure 17 and Table 10. The comparison between the SARS 
genome and those of group 1, 2 and 3 coronaviruses is reported in Figure 18. 

Nucleotides 1-73 contain a predicted RNA leader sequence followed by an untranslated 
region (UTR) of 197 nucleotides. Hie UTR is followed by two overlapping open reading frames 
(ORFla, ORFlb), which encompass two-thirds of the genome (nucleotides 265-21485). They 

10 encode for a large polyprotein, which is predicted to be processed by viral proteases to generate 
the replicase complex. The 3' part of the genome contains the genes coding for the four 
structural proteins (S, spike protein, E, envelope protein, M, matrix glycoprotein, and N,* 
nucleocapsid protein), and eight predicted ORFs of unknown function (Figure 17). Finally, at the 
3* end of the genome, we found a second UTR o!340 bases followed by a poly(A) tract We 

15 identified a putative intergenic (IG) sequence also referred to as transcription-associated 

sequence (TAS), which is a typical feature for coronaviruses. The IG sequence is characterized 
by 6-18 nucleotides present at the 3" end of the leader and can be found in front of each gene. 
The IG sequence plays a key role in RNA transcription and its regulation. The IG sequence of 
the SARS virus is characterized by the sequence SEQ ID NO: 7293 and is present nine times in 

20 the genome (Figure 17). The sequence of the leader and IG are peculiar for each coronavirus and 
represent a specific signature for the virus. 

The Replicase Region 

The replicase gene, ORFlab (SEQ ID NO: 7232), consists of two overlapping ORFs, 
ORFla and ORFlb, which can be translated as a single polyprotein by frame shift of the 
25 ribosome in position 13,393, within the polymerase encoding region. See Brierley et al, Embo J 
1987: 6(12): 3779-3785. As expected, a stem-loop sequence is present ten base pairs 
downstream of this site (SEQ ID NO: 7390; , 

S'-^XXjTGTAAGTGCAGCCCGTCTTACACCG-S^. The polyprotein is cleaved co- and/or 
post-translationally into multiple proteins by its own encoded proteases. Using the cleavage 

30 consensus sequence and by analogy with other coronaviruses, we have mapped the possible 
cleavage sites of the polyprotein and have identified 14 products, which comprise the leader 
protein p28, the homologue of the MHV p65 protein and other twelve proteins, named from nspl 
to nspl3 (nsp, non structural protein) (Figure 17 and Table 10). The amino acid sequence 
analysis suggests the presence of several functional motifs within the putative ORFlab proteins. 

35 In particular, we have mapped two potential proteases (nspl and nsp2), one growth factor-like 

motif (nsp7) within ORFla, whereas in ORFlb we identified the RNA polymerase (nsp9), and a 
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predicted helicase (nsplO). The other predicted cleavage products (nsp3, nsp4, nsp5, nsp6, 
nspl 1, nspl2 and nspl3) are proteins of unknown function. Many of these proteins are 
presumably present in the RNA replication complex, which is associated with the membranous 
structures in the infected cells. In particular, nsp3 and nsp4 contain hydrophobic domains. As 

5 shown in Figure 18, the replicase region of SARS has a similar organization to group 1 , 2 and 3 
coronaviruses; however, the overall aminoacid conservation is low (Table 1 1). The most 
conserved proteins are the polymerase and the helices. 

Nspl is the papain-like cysteine protease (PLP), which cleaves the first two protein 
products (leader protein p28 and p65 homologue). Within the nspl of MHV, two domains with 

10 papain-like protease activity (PLP1 and PLP2) have been mapped, (Kanjanahaluethai et al 
(2000) /. Virol 74(17):791 1-21) which are also conserved with Bovine, transmittable 
gastroenteritis virus (TGV) and Human 229E coronaviruses. However, by sequence alignment 
with the SARS nspl, we identified only one PLP domain containing the catalytic residues 
Cys833andHis994. 

15 Nsp2 is the chymotrypsin-picomavirus 3C-like protease (3CLp), which is responsible for = 

the post-translational processing of the other 12 proteins, most of them cleaved at Q/A or Q/S 
sites. (Ziebuhr etal (1999) /. Virol 73(1): 177-85). It also performs autoproteolytic activity. The 
principal catalytic residues are well conserved with other coronaviruses and are located at 
position His41 and Cysl45. Furthermore, even the conserved aminoacids Tyrl61 and Hisl63, 

20 which are believed to be involved in substrate recognition and to be indispensable for proteolytic 
activity, (Hegyi et al (2002) /. Gen Virol 83(Pt3):581-593) were found in the sequence of .the 
SARS 3CLp. 

The invention includes the orflab sequence of SEQ ID NO: 9960 and die orfla sequence 
of SEQ ID NO: 9961, including fragments, variants, homologs, etc. thereof. 

25 The Structural Reidon 

Analysis of the nucleotide sequence at the 3' part of the SARS genome identified 12 
predicted open reading frames. They are coded within 8.2 kb and comprise the four structural 
proteins S, E, M and N, common for all coronaviruses and eight predicted ORFs, which are 
specific for this virus (Figure 18). SARS-specific IG sequences upstream of most ORFs (Figures 
30 17 & 18) suggest that most genes are likely to be transcribed independently. Interestingly, 

sequences identical to the group 2 IG are also present at the end of the RNA leader and in front 
of the Matrix encoding gene and of ORF 10. 

The spike is a type I glycoprotein, which forms the large spikes on the surface of the virion 
and is responsible for receptor-binding and membrane fusion. (Gallagher (2001) Adv Exp Med 
35 Biol 494: 183-92). The protein is 1255 residues long with 17 predicted N-glycosylation sites. It 
has a 13aa leader peptide and a 17 aa C-terminal membrane anchoring sequence (1202-1218). 
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Some (MHV, HCoV-OC43, AIBV and BCoV), but not all (TGV, FEPV, HCoV-229E) 
coronaviius spike proteins are proteolytically cleaved in two subunits, SI and S2. SI is 
supposed to form the bulbous head, which stays non-covalently linked to the C-terminal 
membrane anchor. Cleavage is mediated by a basic aminoacid sequence, which resembles the 
5 consensus sequence for a furin cleavage site. (Garten et.aL, Biochimie 1994; 76(3-4): 217-225). 
However, in case of this SARS virus isolate, we were not able to identify such a sequence, 
implicating that the S protein of this SARS virus isolate is unlikely to be cleaved during 
maturation. Secondary structure predictions indicated that the global architecture of the spike 
protein is conserved within all known coronaviruses. The S 1 domain is mainly formed by beta 

10 sheets and likely adopts a globular fold, while in the S2 domain extensive alpha helical regions 
are predicted. In addition, the LearnCoil-VMF program, specifically designed to identify coiled- 
coil-like regions in viral membrane-fusion proteins, predicts two coiled-coils within S2, spanning 
aminoacids 900-1005 and 1151-1185, respectively (Figure 19). Both coiled-coil regions contain 
a leucine-zipper motif, which is also present in the spikes of all coronaviruses. Leucine zippers 

15 are known to promote protein oligomerization; since the spike proteins of TGV and MHV form 
hetero-trimers, (Delmas etal.J Virol 1990; 64(ll):5367-5375) (Godeke, etal.,J Virology 2000; 
74(3):1566-1571) it is conceivable that in SARS leucine zippers play a role in promoting and/or 
stabilizing a similar quaternary structure. The spike protein plays a major role in the biology of 
coronaviruses because the SI domain contains the receptor-binding domain and the virus 

20 neutralizing epitopes, while the S2 domain is involved in the process of membrane fusion, which 
is essential for virus infectivity. As expected, multiple sequence alignment of different spike 
proteins showed a major degree of variability within the SI domain, whereas S2 is more 
conserved. 

The envelope protein E is a very short polypeptide of 76 aa, involved in the morphogenesis 
25 of the virion envelope. (Godet et cd., Virology 1992; 188(2):666-675). Computer analysis 
predicts a long transmembrane domain close to the N-terminus and two N-glycosylation sites. 
The level of aminoacid similarity with other coronaviruses is very low and the best homology is 
with the small envelope protein of the transmissible gastroenteritis virus (TGV). 

The matrix glycoprotein (M) is a 221-residue polypeptide with a predicted molecular 
30 weight of 25 kDa. Computer analysis predicts a topology consisting of a short aminoterminal 
ectodomain, three transmembrane segments and a carboxyl terminus located at the interior side 
of the viral envelope. In analogy with the matrix glycoprotein of TGV, that of the avian infective 
bronchitis virus (AIBV) and that of the hypervirulent MHV-2 strain the SARS M glycoprotein is 
N-glycosylated at the N-terminus. SARS M protein shows highest similarity to group 2 viruses 
35 (Table 11). 
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Finally, the nucleocapsid protean N is a 397-residue-long phosphoprotein that interacts 
with viral genomic RNA to fonn the nucleocapsid llie level of conservation with other 
coronaviruses is low, ranging from 26,9% identity with the HCoV-229E to 37,4% identity to the 
Bovine coronavims (BcoV) (Table 11). Epitope analysis of the nucleocapsid protein has been 
5 earned out (Li et aL (2003) Geno Prot & Bioinfo 1:198-206) in which the epitope site at the C 
terminus of the protein was located as SEQ ID NO: 7394 (amino acids 371-407 of SEQ ID NO: 
6052). 

In addition to the above fundamental proteins, many viruses express a set of other peptides, 
which are generally dispensable for viability, but can influence the infectivity potential of the 

10 virus, (de Haan et aL, Virology 2002; 296(1): 177-189). These proteins are generally conserved 
within members of the same serogroup, but differ profoundly among the groups. For this reason, 
they are generally referred to as group-specific proteins (Figure 11). Members of the group 1, 
represented here by HcoV-229E, have two group-specific genes located between the S and E 
genes and sometimes one or two ORFs downstream of the N gene, preceding the 3' UTR region 

15 of the genome. Viruses of the group 2, with MHV as prototype, have two group-specific genes 
(2a and HE) between ORFlb and S, as well as other two between S and E genes. Finally, the 
group 3 viruses, represented by the prototype AIBV, have two group-specific genes between S 
and E and other two between the M and N genes. 

. With the exception of the hemagglutinin esterase HE, for which hemagglutinating and 

20 acetyl-esterase enzymatic activities have been demonstrated, all the other group-specific ORFs 
encode proteins whose role has not yet been established 

Interestingly, the arrangement of specific genes in the SARS genome is peculiar and the 
predicted ORFs do not display any significant homology with ORFs present in the other 
coronaviruses, nor with any other known protein from different organisms. Like viruses of the 

25 group 1 and 3, SARS lacks the HE hemagglutinin and does not contain ORFs between the 

ORFlb and the S gene. Furthermore, two predicted ORFs (ORF3 and ORF4) are encoded in the 
region between S and E, and superimpose for most of their length. ORF3 has an IG sequence 2 
bp upstream of the ATG start codon. In contrast to the other groups, SARS contains five 
predicted ORFs in the region between M and N genes. ORF7 is located 10 bases downstream of 

30 the stop codon of M gene, and has an IG sequence 155 nucleotides upstream from the ATG start 
codon. Similarly, ORF8 and ORF10 present an IG right upstream of their ATG start codons. On 
the other hand, the 5' ends of ORF9 and ORF11 shortly superimpose with the flanking genes, 
and for this reason they do not need an IG to activate transcription. ORF12 totally superimposes 
with the N gene and shares very low homology with a 22kDa protein of the MHV virus, coded in 

35 the corresponding region. 
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Despite the absence of indications of possible localization and function deriving from 
sequence similarity, ORF3, ORF7 and ORF8 contain hydrophobic segments, suggesting 
association with membrane structures. In addition, ORF3, the longest among the SARS specific 
gene, is the only one that encodes for a peptide containing a high number of predicted O- 
5 glycosylation sites (Table 1 1). Predicted N-glycosylation sites have been identified in ORF3, 
ORF1 1 and ORF12. 

Two shorter ORFs in the non-structural regions are SEQ ID NOS: 9965 and 9966. The 
invention includes polypeptides with these sequences, and also fragments, variants, etc. 

Phvlozenetic analysis 

10 The substitution frequency within 922 conserved bases from the pol gene of eleven 

coronaviruses from the three different serogroups has been used in the past to show that the 
variability within members of each serogroup is much smaller than between members of 
different serogroups, confirming the previously described serological groupings. (Stephensen et 
a/., Virus Res 1999; 60(2):181-9). We used the 922 bp region of the pol gene of SARS and 

15 aligned it with the same fragment from other 12 coronaviruses. The tree obtained showed that 
the SARS virus is distinct from the other three groups of coronaviruses (Figure 20). Similar 
results were obtained using the full-length aminoacid sequences of pol, 3CL-protease and 
helicase from the replicase region and those of the spike and the matrix glycoproteins from the 
structural region (data not shown). These data confirmed that the entire genome of the SARS 

20 virus clusters in a new group (group 4) of coronavirus. 

To gain more resolution for possible evolutionary relationships we performed the analysis 
using consensus Sequences of predicted domains of the proteins. In particular, we generated 
consensus sequences of the SI domain of the spike protein from the group 1 and group 2 and 
then we compared them to the SI domain of the SARS spike. No consensus could be generated 

25 from the group 3 since only the spike protein of AIBV is known. Interestingly, the tree 

constructed from the alignment of SARS SI with the consensus generated from the two groups 
of spike proteins was different from that in Figure 20, and showed a much closer relationship 
between SARS and group 2 coronaviruses (Figure 21A). Further analysis showed that 19 out of 
the 20 cysteines present in the SARS SI domain are spatially conserved with the group 2 

30 consensus sequence, while only five are maintained either within the group 1 and group 3 

sequences (Figure 21B). Given the fundamental role played by cysteines in protein folding, it is 
likely that the SI domain of SARS and group 2 coronaviruses share a similar spatial 
organization. 
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Sequence variability between SARS coronaviruses 

We compared the FRA sequence to the four complete SARS genomes available on the 
Web. A total of 30 mutations were detected Nine of these mutations were silent while 21 
resulted in aminoacid substitutions (Table 12). Within ORFla, three silent and seven productive 
5 mutations were detected In ORFlb, there were five silent and three productive mutations. One 
of the productive mutations was caused by two nucleotide substitutions resulting in a single 
aminoacid change. Five changes were located in the spike protein, four of these were productive 
and one silent Two productive mutations were in ORF3 and in the matrix glycoprotein M. One 
productive mutation each was in ORF10 and in the nucleocapsid protein N. 
10 Hie overall difference between FRA and TOR2 was of nine nucleotides resulting in two 

silent mutations and seven aminoacid changes. Hie difference between FRA and Urbani is 12 
nucleotides, which result in five silent mutations and seven aminoacid changes. For CUHK 16 
nucleotides were different, five of which were silent mutations. For FRA and HKU 14 nucleotide 
changes resulted in four silent and nine productive mutations. 

15 EXAMPLE 2 -Production, Inactivdtion and Purification of Whole SARS Virus Using MCS 
Chromatography Resin Purification Followed by Density Gradient Ultracentrifugation 
A SARS isolate FRA1 (EMBL: AY3 10120) was passaged on VERO cells that were 
cultivated in DMEM (Gibco: Cat No. 21969-035, Lot No. 3078864), Penicillin/Strep (Gibco: Cat 
No. 15070-063, Lot No. 1120042), and 3% FCS (Gibco: Cat No. 10270-106, Lot No. * 

20 40F6130K) at 37°C, 5% C0 2 . Trypsin (Gibco: Cat No. 25300-054, Lot No. 3078729) was used 
for detaching the cells. * 

For virus production the third passage was used for inoculation of VERO cells at a moi of 
-0.1. Cells were incubated with the virus for 1 h at 37°C in infection medium (DMEM without 
PS, FCS); after Ih cells were washed twice and further incubated at 37°C for 48 h in the presents 

25 of 3% FCS and antibiotics. The supernatant was harvested 48 hours post infection (p J.) and 
precleared by centrifugation at 3000 rpm at 4°C for 10 min. 

The SARS virus was inactivated by p-propiolactone (BPL) treatment (1:2000) for 18 h at 
4°C, followed by 3 h at 37°C. Testing the virus on successful inactivation, VERO cells were 
incubated with 10 ml BPL treated supernatant for 4 days at 37°C; subsequentially, the 

30 supernatant was transferred to a fresh VERO cell culture and further incubated for another 4 
days. Cells wore checked for cytopathic effect (CPE). 

200 ml of the BPL-inactivated SARS virus harvest was then clarified using a 0.65 fim- 
pore-size filter (47 mm diameter) to pass virus particles and retain cell debris. The filter unit was 
connected to a Masterflex pump, which accomplished a consistent flow rate of 40 ml/min. 
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A. MCS Ckromatozrv phv Purification Step 

The filtered virus suspension was then subjected to MCS chromatography. Hie MCS 
column was prepared as follows. 27 ml slurry led to 14 ml sedimentated resin which was packed 
using a Gotec Superformance Column (diameter 1.0 cm, height 15,7 cm, volume 12.33 ml). 1% 
5 of the column volume of a 1% acetone solution was injected to the column and the column was 
run with a flow of 100 cm/h. Hie HETP, N and As values were thai calculated as HETP: 0,056 
cm, N / m: 1790 and As = 1.20. 

Hie amount of proteins in the purified solution after the MCS chromatography step were 
assessed with a bicinchoninic acid (BCA) method Qhterchim) {see, e.g., 
10 http://www.pkrceneLcom/fUes/baLpdft and electrophoresis. 



SDS-PAGE was done in accordance to Laemmli, Nature (1970) 227:680-685. Samples for 
SDS-PAGE were diluted to a protein concentration of 77 /ig/mL Different protein 
concentrations, were loaded depending on the gel types used (10/12/15 Wells, Novex/Ihvitrogen): 



Number of Wells 


Protein Concentration in the Dilution 


Load 


Protein/Well 


10 Wells 


77/ig/ml 


20 ul 


ins 


12 Wells 


77ug/ml 


15-20 ul 


0.75 - 1 ug 


15 Wells 


77/ig/ml 


10 Ml 


0.5 fig 



15 Samples for use in a reducing SDS-PAGE were prepared as follows: 





26 ill sample or diluted sample 




+ 10/il NuPage Sample Buffer (4x) SDS NP0003 




+ 4 jd TCEP Bondbreaker Solution 77720 
(1:2 in MilliO water) 


Final Volume: 


40 pi 



The samples were heated for 10 minutes at 70°C or left at room temperature for 1 hour 
(leaving the samples at room temperature prevents the M protein of Corona Virus to 

ID coagulate/forming complexes), and then centrifuged for approximately one minute at 14,000 rpm 
in a table top centrifuge. 

Markers for use on the gel were prepared as follows. Gel bands containing less than 1 /ig 
of proteins were easily visualised with the silver staining procedure using the Silver Staining Kit 
Protein, Plus One Staining Protocol (Pharmacia Biotech). 

15 Western blotting was performed as follows. A semi-dry blotting technique was used to 

transfer the proteins from the SDS gel to a nitrocellulose membrane. The transfer was performed 
with a current of 0.8 mA/cm 2 for 1 hour. A rabbit polyclonal antibody against SARS vims was 
used to perform the immuno probing using the Western Breeze, Novex Chromogenic Western 
Blot Immunodetection Kit (Novex/Ihvitrogen). 

}0 The chromatogram of the inactivated SARS MCS capture step is depicted in FIGURE 27. 

To estimate purity, MCS chromatography fractions were analysed by silver staining on NuPage 
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10% or 4-12% Bis-Tris-Gel (Novex) under reduced conditions, heated for 10 minutes at 70°C 
(Figure 28). The fractions were also analysed under the same conditions by western blot (Figure 
29) to estimate purity, using PAK 11/03 SARS Cov 270603 neutralizing titer 1:512 (this 



antibody was used for this and subsequent western blots). Purity estimates are as follows: 



Sample 


yolume 
/ml 


[Protein] 
/US/ad 


Total Protein 
/ IDE 


Step Recovery 
Protein/ % 


Corona Harvest 


100 


2547.6 


254.76 


100 


After Filtration = Load 


100 


2440.3 


244.03 


95.8 


Flow Through 


85 


2321.4 


197.32 


77.5 


Wash 


49.32 


468.5 


23.11 


9.1 


Peakl 


12.12 


252.7 


3.062 


12 


Total Recovery 






464.4 


86.5 



5 

B. Density Gradient Ultracentrifueation Step 

The eluted SARS virus fraction was then subjected to density gradient ultracentrifugation 
with a swinging bucket rotor to further purify the inactivated virus. 3 ml of MCS peak fraction 
wereloaded onto a linear gradienT(i5-60% sucrose; 17 ml 15% and 17 ml 60% sucrose in 
10 gradient mixer). The separation was performed with a Beckman SW 28 rotor at 20,000 rpm for 
2 hours. 

The content of sucrose and protein in the. linear density gradient ultracentrifugation 
fractions are depicted in the following table, the graph in figure 30 and the estimation of purity in 
figure31: 



Fraction 


Fraction Size /ml 


[Sucrose] /% 


[Protein] //ts/ml 


1 


2 


61 


96.12 


2 


2 


59.4 


98.62 


3 


2 


57.5 


87.63 


4 


2 


54.5 


86.91 


5 


2 


50.5 


79.9 


6 


2 


47.2 


74.3 


7 


2 


43.7 


68.05 


8 


2 


40.2 


60.43 


9 


2 


37.2 


57.38 


10 


2 


34 


53.12 


11 


2 


30 


50.63 


12 


2 


25.7 


35.02 


13 


2 


22.4 


35.33 


14 


2 


19.5 


39.25 


15 


2 


15.5 


69.79 


16 


2 


8.5 


169.03 


17 


2 


8.5 


128.96 



15 

The protein concentration of fraction 11 (Figure 31 SDS-gel) was measured again against a 
standard curve prepared in 30% sucrose and lead to a protein concentration of 3.67 /xg/ml (0.05 
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fig on the gel). The M protein appears to be missing in this preparation possibily due to sample 
treatment procedure (heated samples). 

There may be discrepancies in the protein concentration measurements in Table 2 due to 
sucrose interference with this assay. 

5 EXAMPLE 3 -Production, Inactivation and Purification of Whole SARS Virus Using MCS 
Chromatography Resin Purification Followed by Density Gradient UUracentrifugation 

Inactivated SARS virus was prepared as described in Example above. 
A MCS Chromatography Purification Step 

In this example, 200 ml of inactivated SARS virus harvest were subjected to MCS 
10 chromatography. The chromatogram of the capture step of inactivated SARS virus purification 
with MCS is depicted in FIGURE 32, the protein recovery in the following table and the 



estimation of purity in FIGURE 33: 





Volume 


[Protein] 


Total Protein 


Step Recovery 


Sample 


/ml 


/aetail 


/mg 


Protein/ % 


Corona Virus Harvest 


200 


2239.2 


447.83 


100 


After Filtration = Load 


200 


2245.1 


449.02 


100.3 


Flow Through 


185 


2126.3 


393.37 


87.8 


Wash 


49.32 


450.1 . 


22.2 


5.0 


Peakl 


4.43 


1245.6 


5.52 


L2 


Total Recovery 






421.08 


93.7 



B. Density Gradient Ultracentrifuzaiion Step 
15 3.5 ml of MCS peak fraction were then loaded onto a linear gradient (15-40% sucrose: 16 

ml 15% and 16ml 40% sucrose in gradient mixer). The separation was performed with a 
Beckman SW 28 rotor at 20,000 rpm for 2 hours. 

The content of sucrose and protein in the linear density gradient ultracentrifugation 
fractions are depicted in the following table and the graph in FIGURE 34: 



Tube 


Fraction Size / ml 


[Sucrose] / % 


[Protein] / /tg/ml 


1 


2 


40 


45.86 


2 


2 


39 


45.68 


3 


2 


37.5 


44.14 


4 


2 


35.5 


37.82 


5 


2 


33.5 


34.48 


6 


2 


31.5 


31.76 


7 


2 


30.5 


29.49 


8 


2 


28 


30.87 


9 


2 


25.5 


31.7 


10 


2 


23.5 


26.74 


11 


2 


21.75 


. 23.58 


12 


2 


20 


35.33 


13 


2 


18 


96.38 


14 


2 


14.5 


523.79 
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15 


2 


8 


941.97 


16 


2 


8 


696.7 



Protein recovery is shown in the following table and the estimation of purity is shown in figure 
35. Electron Micrograph pictures of density gradient fractions 8, 9 and 10 are shown in figure 



36: 



Step 


Volume /ml 


Protein / fig/ml 


Total Protein /mg 


Step Protein % 


Load 


3.5 ml 


1245.6 


4359.6 


100 


Bulk Protein Fractions 


3.5 ml 


720.8 


4324.9 


99.2 


Viral Peak Fraction 


8ml 


29.7 


237.6 


SJS 


Total Recovery 






4562.5 


104.7 



5 

EXAMPLE 4 - Mouse Immunization with Inactivated SARS Virus 

Mice were imm unize d subcutaneously on days 0, 14, and 28 with 5 fig BPLrinactivated 
S ARS-CoV particles (BPL-S ARS-GoV), either alone or together with Alum or MF59 as 
adjuvants. Serum was collected on days 0 (pre-immunization), 13 (post 1st immunization), 28 

10 (post 2nd), and 35 (1 week post 3rd immunization). Neutralizing antibodies were assessed for 
blocking SARS-CoV infection of Vero cells in vitro. After 3 immunizations, neutralization titers 
were in the range 1:100-1:1000, which are levels similar to those present in the serum of SARS 
convalescent patients. As shown in the following table, the non-adjuvanted vaccine induced 
neutralizing antibody after the third immunization, and potency of this vaccine was enhanced 

15 significantly by including the adjuvants, with neutralizing antibody appearing after then 2nd 
immuniz ation and overall titers increasing after then 3rd immu nization: 



Immunogen 


Neutralization Titer 


pre 


post 1st 


post 2nd 


post 3rd 


BPL-S ARS-CoV+MF59 (5 Mg) 


<1:20 


<1:20 


1:158 


1:630 


I BPL-S ARS-CoV+Alum (5 MB) 


<1:20 


<1:20 


1:67 


1:612 


BPL-S ARS-CoV (5 /ig) 


<1:20 


< 1:20 


<1:20 


1:71 


PBS 


<1:20 


<1:20 


<1:20 


<1:20 



EXAMPLE 5 - BaJb/cMouse Immunization with Inactivated SARS Virus 

A Balb/c mouse model for SARS infection has been developed (Subbarao et aL (2004), 
20 J.ViroU 78:3572-77. In this model, Balb/c mice are inoculated intranasally with 10 4 TCID50 of 
virus. At 48 hours post-inoculation, a 2-log increase in the TCID50 virus titer can be detected in 
the lungs of infected mice. While virus replication is readily detected, the mice do not show any 
SARS disease symptoms and spontaneously clear the virus one week after inoculation. A 
decrease in virus titer in previously-immunized animals as compared to control animals 
25 demonstrates a protective effect of the vaccine being evaluated. 

In this example, four Balb/c mice per group are immunized three times with 5 ng BPL 
inactivated S ARS-CoV (days 0, 14, 28) either alone or in combination with MF59 and 
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challenged with 10 4 TCID50 of SARS-CoV on day 43. Two days following vims challenge the 
mice are euthanized and SARS-CoV is quantified from nasal turbinates (NT) and lungs and the 
mean virus titer for each mouse is measured. Control groups received PBS alone, or an influenza 
virus vaccine (FLU) with or without MF59 adjuvant Data were as follows (see also Figure 51), 
5 where four mice were tested per group and virus titers are expressed as logio TCID50 per gram of 
tissue: 



Immunogcn 


Virus replication in lungs of 
challenged mice 


Virus replication in nasal 
turbinates of challenged mice 


# infected/ 
# tested 


Mean (± SE) 
virus titer 


# infected/ 
# tested 


Mean (± SE) 
virus titer 


PBS 


4/4 


6.3 ±03 


3/4 


2.8 ±035 


MF-59 alone 


4/4 


6.1 ±0.13 


3/4 


3.0 ±0.38 


FLU vaccine (5 /ig) 


4/4 


6.3 ±0.07 


3/4 


2.9 ±0.36 


FLU vaccine (5 fig) + MF-59 


4/4 : 


6.0 ±0.19 


4/4 


3.0 ±0.11 


BPL-SARS-CoV (5 lip) 


»4 


1.6 ±0.13* 


0/4 


Not detected ** 


BPL-SARS-CoV (5 jig) + MF-59 


" 0/4 


Not detected* 


0/4 


Not detected ** 



Two-tailed Student's t-test, compared toTBS-immunized mice, showed: * P<0.00001 or ** P=0.025 



. As shown, virus could not be detected in the BPI^SARS-CoV immunized mice. The lower 

10 limit of detection of infectious virus in a 10% w/v suspension of lung homogenate was 1.5 
logioTCD5o/gm, and in a 5% w/v suspension of nasal turbinates the limit was 1.8 
logioTCIDso/gm. Viral titers in the immunized mammals were thus below these threshold values. 

Thus the inactivated SARS-CoV vaccine was very efficient at preventing virus infection, 
as only one of eight mice immunized with the vaccine, either with or without MF59 adjuvant* 

15 was infected. Similar protection was not observed in control groups of PBS diluent, MF59 
adjuvant, or influenza virus vaccine with or without adjuvant 

Neutralization titers of sera taken from the animals in the challenge study were assessed at 
two weeks post-lst, one week post-2nd, and one week post-3rd immunization. Mice immunized 
with the vaccine with MF59 adjuvant had already developed a neutralization titer of 1:71 after 

10 the 2nd immunization, which increased to 1:588 after the 3rd immunization, whereas mice 
receiving the unadjuvanted vaccine did not have any neutralizing activity post-2nd and a 
neutralization titer of 1:64 post-3rd immunization. Sera from mice in each of the control groups 
did not show any neutralization activity. These data clearly demonstrate not only the ability of 
the inactivated SARS-CoV vaccine to induce protective levels of S ARS neutralizing antibodies, 

15 but also a beneficial effect of formulating the vaccine with adjuvant for elevated neutralization 
titers. 

EXAMPLE 6 - Preparation of OMV comprising SARS viral antigens 

Kcoli were transfected with a plasmid of interest (encoding a SARS viral antigen). Single 
colonies harbouring the plasmid of interest were grown overnight at 37°C in 20 ml of LB/Amp 
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(100 /ig/ml) liquid culture. Bacteria were diluted 1:30 in 1.0 L of fresh medium and grown at 
either 30°C or 37°C until the OD550 reached 0.6-0.8. Expression of recombinant protein was 
induced with IPTG at a final concentration of 1.0 mM. After incubation for 3 hours, bacteria 
were harvested by centrifugation at 8 000 x g for 15 minutes at 4°C andresuspendedin 20 ml of 

5 20 mM Tris-HCl (pH 7.5) and complete protease inhibitors (Boehringer-Mannheim™). All 
subsequent procedures were performed at 4°C or on ice. 

Cells were disrupted by sonication using a Branson Sonifier 450 and centrifuged at 5 000 x 
g for 20 min to sediment unbroken cells and inclusion bodies. The supernatant, containing 
membranes and cellular debris, was centrifuged at 50000g (Beckman Ti50, 29 000 rpm) for 75 

10 min, washed with 20 mM Bis-tris propane (pH 6.5), 1.0 M NaCl, 10% (v/v) glycerol and 

sedimented again at 50000g for 75 minutes. The pellet was resuspended in 20mM Tris-HCl (pH 
7.5), 2.0% (v/v) Sarkosyl, complete protease inhibitor (1.0 mM EDTA, final concentration) and 
incubated for 20 minutes to dissolve inner membrane. Cellular debris was pelleted by 
centrifugation at 5000g for 10 min and the supernatant centrifuged at 75000g for 75 minutes 

15 (Beckman Ti50, 33000 rpm). Outer membrane vesicles were washed with 20 mM Tris-HCl (pH 
7.5) and centrifuged at 75 000 x g for 75 minutes or overnight The OMV was finally 
resuspended in 500 fil of 20 mM Tris-HCl (pH 7.5), 10% v/v glycerol. Protein concentration 
was estimated by standard Bradford Assay (Bio-Rad), while protein concentration of inner 
membrane fraction was determined with the DC protein assay (Bio-Rad). Various fractions from 

20 the isolation procedure were assayed by SDS-PAGE. 

EXAMPLE 7- Immunogenicity, dose and route schedule for recombinant Spike protein in 
mice 

The immunogenicity, route and dosing of the recombinant spike proteins of the invention 
in mice may be assessed using the below detailed protocol. Preferably, the administered antigen 
25 will elicit neutralizing antibody titers at least in the range of 1/100-1/1000. Increasing doses of 
antigen can be tested in the range from 5 to 20 fig of recombinant Spike antigen alone or mixed 
with an equal volume of MF59-citrate, administered SC or IM to anesthetized mice in 100 |jJ of 
inoculum. Groups of BALB/c mice, 6 per treatment are primed at day 0 and boosted at day 14 
and 28. 
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Group 


Treatment 


Dose/Route 


Sampling interval 


Number of mice 


1-3 


Rec-Spike protein 


20,10,5//g/SC 


7, 21, 35, 42 d 


6 per dose level 


4-6 


Rec-SpDce protein 


20, 10,5^g/SC 


7 


6 per dose level 


7-9 


Rec-Spike protein 


20,10,5j/g/IM 


| 7, 21, 35, 42 d 


6 per dose level 


10-12 


Rec-Spike protein 


20,10,5//g/IM 


7 


6 per dose level 


13-15 


Rec-Spike - MF59 


20, 10,5//g/SC 


! 7,21,35>42d 


6 per dose level 


16-18 


Rec-Spike - MF59 


20, 10,5//g/SC 


7 


6 per dose level 


19-21 


Rec-Spike - MF59 


20, 10,5//g/IM 


7, 21, 35, 42 d 


6 per dose level 


22-24 


Rec-Spike - MF59 


20, 10,5//g/IM 


7 


6 per dose level 


25 


MF59 


NA/SC 


7, 21, 35, 42 d I 


6-»-6 (sac d 7 and 42) 


27 


MF59 


NA/IM 


7, 21, 35, 42 d j 


6 + 6 (sac d 7 and 42) 


29 


Saline 


NA/SC 


7, 21, 35, 42 d 


6 + 6(sacd7and42) 


31 


Saline 


NA/IM 


7, 21, 35,42 d 


6 + 6(sacd7and 42) 



This protocol can also be used to assess the Thl/Th2 profile of the specific immune 
response elicited by the recombinant Spike protein. Neutralizing and Spike-specific antibody 
titers will be assessed at days 7, 21, and 35; IgG2a vs IgGl isotype of the Spike-specific 

5 antibodies will be determined at days 21 and 35; in vitro proliferation of lymph node and splenic 
T cells against the recombinant Spike protein will be determined at days 7 and 42, respectively, 
IFN-7 and IL-4 production by splenic T cell against the recombinant Spike protein from SARS- 
CoV will be assessed at day 42. Peripheral blood will be collected at days 7, 21, 35; lymph 
nodes cells at day 7, and spleen cells at day 42. Neutralizing and Spike-specific antibody titers 

10 and isotypes will be determined by inhibition of SARS-CoV infection of Verb cells and by 

ELBA, respectively. Proliferation of lymph node and splenic cells will be determined by 3 [H]- 
Thymidine uptake. Frequencies of splenic IFN-7 and IL-4 producing T lymphocytes, will be 
determined by ELISPOT and FACS. 

EXAMPLE 8 -lmmunogenicity, dosing and route schedule for Spike proteins in rabbits 
15 The immunogenicity, route and dosing of the recombinant spike proteins of the invention 

in rabbits may be assessed using the below detailed protocol. Increasing doses can be tested in 
the range from 5 to 40 j^g of recombinant Spike antigen alone or mixed with an equal volume of 
MF59-citrate, administered SC or IM to anesthetized animals in 200jxl of inoculum. Groups of 
New Zealand white female rabbits, 10 per treatment, will be immunized as shown in the table 
20 below. The animals will be primed at day 0 and boosted at days 14 and 28. Peripheral blood 
will be collected at days 7, 21, and 35. Neutralizing and Spike-specific antibody titers will be 
determined by inhibition of SARS-CoV infection of Vero cells and by FT ISA, respectively. 
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Group 


Treatment 


Dose/Route 


Sampling interval 


Number of rabbits 


1-4 


Full-length Spike protein 


40, 20,10,5/^/SC 


7, 21, 35 d 


10 per dose level 


5-8 


FulHength Spike protein 


40, 20, 10, £*/g/IM 


7, 21, 35 d 


10perdosetevel 


9-12 


Truncated Spike protein 


40, 20, 10.5//g/SC 


7, 21, 35 d 


10 per dose level 


13-16 


Truncated Spike protein 


40,20,10.5j/g/IM 


7, 21,35d 


10 per dose level 


17-20 


FulHength Spike protein - MF59 


40, 20, 10,5//g/SC 


7,21,35d 


10 per dose level 


21-24 


FulHength Spike protein - MF59 


40, 20, 10, 5yg/lM 


7, 21,35d 


10 per dose level 


25-28 


Truncated Spike protein - MF59 


40. 20, 10,5/yg/SC 


7, 21,35d 


10 per dose level 


29-32 


Truncated Spike protein - MF59 


40,20, 10, 5//g/lM 


7, 21, 35 d 


10 per dose level 


33 


MF59 


NA/SC 


7, 21,35d 


10 


34 


MF59 


NA/1M 


7, 21,35d 


10 


35 


Saline 


NA/SC 


7, 21,35d 


10 


36 


Saline 


NA/IM 


7, 21,35d 


10 



EXAMPLE 9 - Immunogenicity and dose schedule for recombinant Spike in ferrets 

The immunogenicity and dosing of the recombinant spike proteins of the invention in 
ferrets may be assessed using the below detailed protocol. Three groups of ferrets, 6 for 
5 treatment, will be immunized with recombinant SARS-CoV Spike protein from CHO cell lines, 
alone or mixed with an equal volume of MF59-citrate, administered SC to anesthetized animals 
in 200/il of inoculum. Hie recombinant Spike protein vaccine will be tested at the dose eliciting 
the highest neutralizing antibody titers in mice at day 35 after the second boost The animals 
will be primed at day 0 and boosted at day 14 and 28. Peripheral blood will be collected at days 
10 7, 21, and 35. Neutralizing and Spike-specific antibodies titers will be determined by inhibition 
of SARS-CoV infection of Vero cells and by ELTSA, respectively. 



Groups 


Treatment 


Dose/Route 


Sampling Interval 


Number of ferrets 


1&2 


Rec-Spike protein 


Y//gor2Y//g/SC 


7, 21,35d 


6 - ' i 


3&4 


Rec-Spike protein + MF59 


Y//gor2Y//g/SC 


7, 21, 35 d 


6- 


5 


Saline 


NA/SC 


7, 21,35d 


6 



The 3 groups of ferrets, 6 animals per group, used for the immunogenicity studies above 
can then be used to assess efficacy of the recombinant Spike protein in protecting vaccinated 

15 animals from infection and/or disease. Anestethized animals will be challenged two weks after 
the last boost intratracheally with 10 6 median tissue culture infectious dose unit (TCID50) of the 
SARS-CoV Utah strain. Infection by SARS-CoV will be assessed by taking nasal, f aringeal and 
rectal swabs from animals for 20 days after challenge as described (12). The presence of SARS- 
CoV in sample materials will be assessed by RT-PCR and infection assay of Vero cells. 

20 Animals will be monitored for clinical signs of S ARS disease by assessing sleeping time, 
temperature, respiratory symptoms, diarrhea, body weight and survival. Protection will be 
determined by the magnitude and duration of virus shedding and by duration and severity of 
disease symptoms and percentages of surviving animals. 
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EXAMPLE 10: Expression of Spike protein for vaccination 

The SARS-CoV Spike glycoprotein was expressed in both full-length and truncated forms, 
using the nSh and nShATC pCMVm constructs described above, both with hexahistidine tags. 
Hie vector constructs were evaluated for expression 48 hr after transfection into 293 cells and 
5 COS7 cells. The full-length Spike protein (nSh) was detected by western blot only in cell lysate, 
but not in culture media (Rgure 52). 

The majority of SARS-CoV full-length Spike protein was expressed in 
transiently-transfected COS7 cells as a high molecular glycoprotein which ran at 540 kDa in 
non-reducing gels (Figure 53). Hie gp540 is heat labile as indicated by the complete dissociation 

10 into monomeric forms (gpl70 & gpl80) by boiling, but it was resistant to DTT treatment These 
data suggest that the recombinant Spike protein is noncovalently associated into a homotrimer 
(gp540). Hie presence of Spike protein in homotrimeric association also was confirmed in 
inactivated, purified SARS-CoV virion particles. Analysis of virion proteins by western blot 
under the same condition used for the characterization of recombinant Spike protein generated 

15 essentially identical results (Rgure 54). 

EXAMPLE 11: Spike protein processing 

In order to characterize Spike protein processing, BHK-21 cells were infected with 
alphavirus replicon particles expressing the SARS-CoV full-length Spike. At 6 hoursr post- 
infection with an MOI of 5, infected cells were labeled for 1 hr with M^S]methionine/cysteine 

10 and chased for up to 4 hours. The [ 35 S]-labeled spike protein was immunoprecipitated by anti- 
S ARS rabbit serum and digested with Endo-EL Both digested and undigested proteins were 
analyzed by SDS-PAGE (4% polyacrylamide). As shown in Figure 55, the full-length spike 
protein is synthesized as an Endo-H sensitive high-mannose glycoprotein (gpl70, an ER form) 
that undergoes modification to an Endo-H resistant glycoprotein with complex oligosaccharides 

:5 (gpl80, a Golgi form). The conversion of gpl70 into the gpl80 form takes place within 2 hours 
(Figure 56). 

EXAMPLE 12: High-level protein expression 

To develop a system for rapid expression of protein antigens, DNA transfection of 293 
(human embryonic kidney) cells was used, tp obtain milligram quantities of recombinant 
0 antigen. Hie most common method for culturing and transfecting 293 cells is in static or 

monolayer cultures. These procedures were modified by performing large-scale transfection of 
293 cells in suspension and expanding the transfected cells in suspension culture for production 
of secreted or intracellular proteins. Several initial experiments were performed at the 100- 
milliliter scale cultures to determine optimum conditions, such as number of cells, type of 
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transfecting reagent (FuGENE 6, lipitoid or RO-1538) and the ratio of DNA to transfection 
reagent Based upon pilot experiments, FuGENE 6 was the best transfecting reagent 

Hie kinetics of gene expression was compared to other viral envelope glycoproteins, and 
the data suggest that stable protein expression peaks around 72 to 96 hours post-transfection, 
5 depending upon the gene of interest, and then significantly decreases thereafter. Thus, using the 
optimum conditions, the transfection process was scaled from 100 ml to 4 liters. Hie 4 liter 
culture can be used for rapidly producing 2-10 milligrams of protein antigens. To facilitate 
antigen purification and also maximize the yield and recovery of the purified protein, 
transfection conditions were optimized by using serum-free medium. 

10 Bulk transfection procedure has been used for the expression of truncated and full-length 

Spike antigens. Hie kinetics of expression for truncated form of the spike protein is presented in 
Figure 56A. Expression of the truncated form of Spike protein peaked around 48 hrs and was 
stable until 72 hrs, therefore the cultures were, harvested at 72 hrs post transfection. 

Collected media were concentrated 20X and used for purification of truncated Spike 

15 protein by a very simple purification strategy where the truncated form of the spike was captured 5 
on GNA lectin followed by DEAE and ceramic hydroxyapatite column chromatography. The 
purified protein was analyzed on SDS-PAGE by silver stain (Figure S6B) and also by western 
blot (Figure 56C). Early efforts were able to purify the truncated form of the spike protein with 
>95% purity and approximately 50% recovery. The molecular mass of the truncated form of the 

20 Spike protein is approximately 170-180 kDa. 

Full-length Spike protein was expressed in 293 cells using the bulk transfection strategy. 
The expression data suggest that, like the truncated form, expression peaked around 48 hrs post- 
transfection and remained stable until 72 hrs. However, contrary to the truncated form and as 
expected, full-length protein is not secreted, but rather is retained within the cells, as shown by 

25 the absence of any signal in western blots of cell culture supernatants. The full-length form of the 
protein was purified from Triton X-100 detergent-extracted cells. Full-length Spike protein was 
then captured on GNA lectin, followed by hydroxyapatite and SP chromatography. Hie 
calculated molecular mass of full-length spike protein is approximately 600 kDa, which is close 
to the theoretical mass for the trimer. 

30 EXAMPLE 13: SARS virus seed cultures 

A S ARS-CoV reference seed virus propagated only in certified Vero cells will be used for 
the generation of the Master and Working Virus Seeds under GMP. A clinical specimen from the 
respiratory tract of a patient infected by the SARS-CoV is inoculated onto documented VERO 
cells, with certified culture media. Culture media containing the virus are harvested at 4 days 

35 post-infection and designated Passage 1 (PI). A second round of virus propagation is again 
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performed in certified VERO cells with certified media, by inoculation of 1 ml perT-75 flask of 
100 times diluted PI virus. Culture supernatant was harvested at 3 days post-infection and stored 
at -80°C as a P2 reference stock virus, without plaque purification. 

Cell banks of Vero cells for further production of SARS-CoV are prepared from specific 

5 cell subsets that have not been used since the emergence of transmissible spongiform 

enephalopathies (e.g. since 1980). A research cell bank of these cells has been prepared using 
specified New Zealand-origin fetal bovine serum. From this research cell bank, a Master Cell 
Bank (MCB) is made under GMP conditions and using only specified and well-controlled media 
and supplements. The cell bank will is tested for absence of adventitious agents according to 

10 applicable US, EU, and international guidelines (see Points To Consider "Characterization of 
cell lines used to produce biologicals", FDA/CBER 7/1993; ICH Q5D Draft 6 "Cell substrates", 
Oct.23, 1996; CPMP/ICH/294/95 "Note for Guidance on Quality of Biotechnological Products: 
Derivation and Characterisation of Cell Substrates Used for Production of 
Biotechnological/Biological Products. (Step 4, 16. July 97); WHO final draft "Requirements for 

15 use of animal cells as in vitro substrates for the production of biologicals" 7.3.1997). 
Tumorigenicity and identity testing is also required for this cell bank. 

The reference virus is plaque-purified and expanded in certified Vero cells in the absence , 
of FCS in order to generate Master arid Working Seeds. Another option to help ensure parity and 
facilitate the assessment of safety of the Master Seed is to subject the S ARS-CoV to pelleting 

20 and resuspension in PBS. The virus suspension is made up to 60% (w/w) sucrose with crystalline 
sucrose, transferred to a centrifuge tube and overlayed with 50, 40, 30, and 20% (w/w) sucrose 
solutions in PBS. The gradient is centrifuged for 72h and then fractionated. The virus^containing 
fraction is diluted and the virions re-pelleted by ultracentrifugation. RNA from the virus pellet is 
isolated and transfected into certified Vero cells whereby the "infectious" positive-strand RNA 

15 will lead to the production of infectious virus, which can be plaque-purified and expanded to 
generate alternative Master and Working Seeds from purified virus RNA. 

Viral seeds are tested for the absence of adventitious agents (see e.g. 21 CFR Revised as of 
April 1, 1994, § 630.35 Test for safety) and for identity, using a highly-specific neutralizing 
antiserum prepared from an independent source. Safety testing of viral seeds for vaccine 

JO purposes is done routinely by service laboratories. Broad-spectrum PCR testing can be used as 
an addition and/or alternative for testing. 

EXAMPLE 14: Scale-up of virus production and inactivation 

A protocol for the production, inactivation, and purification of inactivated S ARS-CoV with 
sufficient structural integrity to elicit protective neutralizing antibody responses in animal 
$5 models involves: Vero.cells are infected with virus at an M.O.L of 0.01 in the absence of FCS 
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and antibiotics; culture medium is collected, cleared by centrifugation, and inactivated with BPL, 
followed by confirmatory testing for complete inactivation; the inactivated material is filtered, 
subjected to MCS-column purification, and further purified by sucrose gradient centrifugation. 
Several modifications and improvements can be developed when adapting this basic 
5 protocol to a larger scale for commercial use. Firstly, the cell culture and infection process can be 
adapted to roller bottles, as an intermediate step to allow rapid production for preliminary trials 
within existing BSL 3+ facilities. Full commercial production will typically use a fermentation 
process in a closed system, but a roller bottle system can be achieved more rapidly. The roller 
bottles do offer a true suspension culture system for Vero cells, which gives various technical 

10 and safety advantages over microcarrier cultures. Suspension cultures can be grown to any 

desired fermentation scale without interfering with the closed system between cell passages, as 
no trypsinization is required. 

To scale up the infection process in roller bottles to 30-50 liters per batch, the optimum 
M.OJL and harvesting periods for selected media and culture conditions should first be 

15 determined. For the larger scale, methods for harvesting and handling larger volumes of highly 
infectious material safely should be used, and so cell separation via centrifugation should be 
replaced by a method such as filtration through single-use filter cartridges. 

The MCS-chromatography and the gradient purification steps described above can readily 
be scaled to a batch volume of up to 50 liters. For larger volumes, however, and for increased 

20 purity, ultrafiltration and sterile filtration steps will be used. Nuclease treatment to remove host 
cell DNA will also be included. 

EXAMPLE 15: Large scale analytical methods 

Analytical methods for the SARS coronavirus include virus titration methods, 
immunological and physico-chemical methods to quantitate and characterize the purified antigen 
25 (ELBA, PAGE, western blots using specific antisera against purified whole virus, etc.). Other 
analytical tests include: fast yield testing via asymmetric field flow separation and laser particle 
detection and counting; Western blot using specific antisera against individual viral proteins; and 
tests for residual host cell DNA. 

Residual DNA testing is generally done by hybridization e.g. using a limit test. Such 
30 testing is performed according to methods already established and validated for other cell lines. 
As an alternative, the Threshold™ method may be used. 

For producing specific antibodies, recombinant protein expression of all the ORFs from the 
structural and non-structural gene regions of the SARS-CoV is used. The ORFs can be cloned 
and expressed in E.coli and, if necessary, also in eukaryotic vectors such as baculovirus. This can 
35 provide sufficient amounts of purified soluble protein to immunize mice and rabbits to produce 
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polyclonal and monoclonal antibodies against SARS proteins and to set up specific ELIS A 
assays. Different expression vectors can be tested to maximize the yield of recombinant protein 
in a soluble form e.g. different vectors, one containing sequences coding for six N-terminal 
histidine residues and another containing a Glutathione-S-transferase protein fused to die 

5 C-terminus of the SARS protein. Hie recombinant proteins can be purified by single step column 
chromatography on either Nickel chelating Sepharose or Glutathione-Sepharose 4B resin. These 
procedures are very rapid and generally produce protein of 60-90% purity, which is suitable for 
raising specific antisera (Pizza et al. (2000) Science 287:1816-20). I^ve mice and two rabbits for 
each recombinant protein can be immunized SC with 20 and 50 /xg recombinant protein, 

10 respectively, given in IFA as adjuvant, at day 0, 14 and 28. Sera are collected at day 7, 21 and 35 
to assess specific titers before euthanasia of the animals for collection of blood and removal of 
spleens. 

For the detection of impurities (e.g. Vero cell derived proteins) in the vaccine preparation, 
rabbit serum reactive against Vero-derived proteins can be used Such antisera are obtained by 

15 imm nnirin g rabbits with at least 10/ig of Vero cell lysate with CFA/IPA. The sera can be 

verified for reactivity against Vero-derived proteins in western blots. For more specific antisera 1 
against specific relevant cell-derived proteins that tend to be co-purified with the virus, mock- 
infected cell culture harvest that have undergone the purification process can be prepared and 
used for immunizing rabbits. 

20 Methods to determine neutralization titers of sera from immunized animals and humans 

can be developed, without the constraints of using infectious S ARS-CoV in a BSI^3+ 
laboratory. One such strategy will be to use recombinant antigens, particularly Spike protein or 
Spike-derived epitopes, and to develop ELIS A assays for measuring antibodies against the target 
protein. Suitable epitopes allow a correlation to be established between the BUS A values and 

25 virus neutralization assay values. This approach provides a faster and more efficient (higher- 
throughput) comparison of specific and protective antibody titers. This ELIS A test is also the 
ideal tool to monitor specific antibodies in safety trials, where several hundred animal sera must 
be tested 

Another strategy is to combine structural elements from both the pathogenic SARS-CoV 
30 and the non-pathogenic coronavirus mouse hepatitis virus (MHV) to construct chimeric virus- 
like particles (VLPs) that can be labeled. The assay is based on fusion between octadecyl 
rhodamine (R18>labeled VLPs and cells (Hoekstra et al. (1984) Biochemistry 23:5675-81). Hie 
method relies on the relief of fluorescence self-quenching of R18 incorporated into VLPs upon 
fusion with cellular membranes. Coronavirus VLPs have been shown to mimic native virions 
35 with respect to their appearance in the electron microscope (EM) and their biological activities. 
As they do not contain viral RNA, however, then they cannot cause a productive infection 
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(Vennema et aL (1996) EMBO J 15:2020-2028). Hie VLP system can be used for the mouse 
hepatitis virus (MHV) strain A59 (MHV-A59) )Godeke et aL (20(K)) 7 Virol 74:1566-15) 
containing a chimeric S protein. The protein chimera, consisting of the ectodomain of the SARS- 
CoV and the transmembrane and endodomain (64 C-terminal amino acid residues) from the 

5 MHV spike protein, can be co-expressed with the MHV M (membrane) and E (envelope) protein 
in OST-7 cells )Godeke et aL). VLPs secreted in the supernatant are harvested, purified and 
labeled with octadecyl rhodamine (Rl 8) (Hoekstra et al). A constant amount of VLPs is 
incubated with a serial dilution of sera at 37°C for 1 hour in a 96-well plate. Subsequently, cells 
expressing the receptor for the S ARS-CoV, the angiotensin-converting enzyme 2 (ACE2) (li et 

10 aL (2003) Nature 426:450-54) is be added and the extent of fusion can be measured with a 
fluorescence spectrophotometer. 

A final strategy to monitor the ability of sera to inhibit cell-cell fusion interactions between 
cells expressing the SARS-CoV S protein and a human cell line expressing the angiotensin- 
converting enzyme 2 (ACE2), a functional receptor for SARS-CoV (Li et aL). This reporter 

15 gene-based assay uses the fluorescent shift (green to blue) of the fluorogenic substrate CCF2/AM 
(AM=acetoxymethyl) upon cleavage by ^-lactamase (Bla) as read-out for cell-cell fusion 
(Ztokarnik et aL (1998) Science 279:84-88). For this assay, a BHK-derived cell line, stably 
expressing Bla and the SARS-CoV S protein is generated. In addition, a human cell line 
expressing ACE2 on its surface is used. BHK cells, expressing the S protein on their surface and 

10 Bla in their cytosol are incubated with serial dilutions of the sera to be tested for lh at 37°C. The 
cell line expressing the ACE2 is loaded with 1/iM CCF2/AM for 1 h at 22°C, washed twice with 
PBS, and co-cultivated with the BHK cells. In case of cell-cell fusion, Bla cleaves the substrate, 
resulting in a green blue shift with excitation at 409 nm. Inhibition of fusion by sera thus 
provides a detectable change. 

15 EXAMPLE 16: Stabilisation of inactivated SARS-CoV 

Although the purified inactivated SARS-CoV vaccine is capable of inducing potent 
neutralizing antibody responses in animals, it is relatively instable and can benefit from 
formulation to increase stability for an acceptable period of time. Suitable formulation changes 
include the use of various buffer systems, pH ranges, stabilizing excipients (e.g. sugars and 

10 sugar alcohols, amino acids, etc.) etc.. Stability testing can be conducted in real-time at normal 
storage temperatures, or can be conducted in an accelerated manner by using elevated 
temperatures. Vaccine stability can thus be increased to approximately one year or longer. 
Lyophilized vaccine formulation can also be used to extend shelf-life, possibly with further 
additives for stability during lyophilisation. 
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EXAMPLE 17: Dose and schedule optimization for inactivated virus 

Animal models of SARS-CoV infection have been reported, including mice, ferrets and 
macaques. As mentioned in example 4 above, mice immunized with the BPL-S ARS-CoV 
vaccine achieve neutralizing antibody titers in the range of 1 : 100 - 1 : 1000, similar to levels 

5 found in convalescent patients, and are 100% protected from infection with a challenge virus. 
While the mouse challenge model is limited only to infection but not disease, ferrets and 
macaques are useful models of the human SARS disease. Two to four days after inoculation with 
SARS-CoV, both ferrets and macaques have been found to shed infectious SARS-CoV particles 
from the throat, nose and pharynx, as demonstrated by RT-PCR and/or virus isolation on Vero 

10 cells. At approximately the same time, the infected animals became lethargic, show respiratory 
distress and eventually die. Histologically, SARS-CoV infection in these animals associates with 
pulmonary lesions of different severity, similar to those found in biopsied lung tissue and 
autopsy material from SARS patients. With the availability of these models, preclinical -studies 
with vaccines can be performed initially in mice for immunogenicity readouts, while efficacy of 

LS optimal doses and schedules can be assessed in the ferret and macaque models. 

- Initial studies in mice are used to determine the optimal dose and schedule required to 
elicit the highest levels of neutralizing antibody, with titers at least in the range of 1/100 - 
1/1000. In parallel to the assessment of neutralizing activity, other features of the humoral 
immune response and cellular immune responses can be investigated. In particular sera from 

2J0 immunized mice can be assessed for the isotype (IgGl vs. IgG2a) of the Spike-specific antibody 
response. Also, the frequencies of splenic CD4+ T cells producing IFNF-7 and JL-4 in response to • 
BPL-SARS-CoV particles will be assessed by ELISPOT and ELBA. These experiments can 
provide insight into the quality of the T cell response helping the priming of a protective 
antibody response. 

15 Increasing vaccine doses can be tested (e.g. from 5 to 20 /ig of BPLrS ARS-CoV alone or 

mixed with an equal volume of MF59-citrate), administered SC to anesthetized mice in 100/il of 
inoculum. Groups of BALB/c mice, 10 per treatment, are immunized, with priming at day 0 and 
boosting at days 14 and 28. Secondary endpoints compare the kinetics of neutralizing vs. Spike- 
specific antibody titers and assess the Thl/Th2 profile of the specific immune response, and so 

30 neutralizing and Spike-specific antibody titers are assessed at days 7, 21, 35, and at 2, 3, 4, and 5 
months after priming. The IgG2a and IgGl titers of Spike-specific antibodies are determined at 
days 21, 35, and at 2, 3, 4, and 5 months after priming. Proliferation and IFN-7 and IL-4 
production by splenic T cells against recombinant Spike protein from SARS-CoV are assessed at 
day 42, and at the end of the 5th month. Peripheral blood is collected at days 7, 21, 35, and at 2, 

35 3, 4, and 5 months after priming. Spleen cells will be obtained at day 42 and at the end of the 5th 
month. Neutralizing and Spike-specific antibody titers and isotypes are determined by inhibition 
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of infection of Vero cells and by EJUS A, respectively. Proliferation of splenic cells is determined 
by 3 [H] -thymidine uptake. Frequencies of splenic DFN-7 and IL4 producing CD4 + T 
lymphocytes is determined by ELISPOT and FACS analysis. 

Based on mouse results, the BPL-S ARS-CoV vaccine can be tested in ferrets for the 

5 induction of protective neutralizing antibody titers. Ferrets are immunized according to a similar 
schedule as the mice and at the dose that elicits the highest neutralizing antibody titers in mice at 
day 35 after the second boost Three groups of ferrets, 6 per treatment, are immunized with BPLr 
SARS-CoV, alone or mixed with an equal volume of MF59-citrate, administered SC to 
anesthetized animals in 200/il of inoculum. The animals are primed at day 0 and boosted at days 

10 14 and 28. Peripheral blood is collected at days 7, 21, and 35. Neutralizing and Spike-specific 
antibodies titers are determined by inhibition of SARS-CoV infection of Veto cells and by 
ELLS A, respectively. Each group of ferrets is used to assess efficacy of the BPL-S ARS-CoV in 
protecting vaccinated animals from infection and/or disease. Anesthetized animals are 
challenged intratracheally, two weeks after the last boost, with 10 6 median tissue culture 

15 infectious dose units (TCID50) of the SARS-CoV CDC strain. Infection by SARS-CoV can be 
assessed by taking nasal, pharyngeal and rectal swabs from animals for 20 days after challenge > 
(Martina et al. supra). The presence of S ARSrCoV in sample materials can be assessed by RT- 
PCR and infection assay of Vero cells. Animals can be monitored for clinical signs of S ARS 
disease by assessing sleeping time, temperature, respiratory symptoms, diarrhea, body weight 

10 and survival. Protection can be determined by the magnitude and duration of virus shedding, by 
• i duration and severity of disease symptoms, and by percentage of surviving animals. The 

formulation eliciting the highest neutralizing antibody titers at day 35 can then be tested against a 
two-fold higher dose of BPL-SARS-CoV given in the same formulation in the same regimen. 
Additional studies can evaluate immunogenicity and efficacy of the candidate vaccine in 

15 non-human primates. Three groups of adult cynomolgus macaques, 4 per treatment, are 
immunized with BPL-S ARS-CoV, alone or mixed with an equal volume of MF59-citrate, 
administered SC to anesthetized animals in 500/il of inoculum. The BPL-S ARS-CoV vaccine 
can be tested at the dose eliciting the highest neutralizing antibody titers in ferrets at day 35 after 
the second boost The animals are primed at day 0 and boosted at 3 and 6 weeks. Peripheral 

JO blood is collected at weeks 1, 4, and 7. A secondary endpoint is to assess the Thl/Th2 profile of 
, the specific immune response. Neutralizing and Spike-specific antibody titers and frequencies of 
peripheral blood CD4+ T cells producing IFN-7 and IL-4 in response to the recombinant SARS- 
CoV Spike protein is thus assessed at weeks 1, 4, and 7. Neutralizing and Spike-specific 
antibody titers can be determined by inhibition of SARS-CoV infection of Vero cells and by 

\5 ELISA, respectively. Intracellular cytokine staining and FACS analysis will be used to quantify 
IFN-y- and IL-4-producing CD4 + T cells. The macaques can also be used to assess efficacy of 
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the BPL-SARS-CoV in protecting vaccinated animals from infection and/or disease. 
Anesthetized macaques can be challenged two weeks after the last boost with 10 6 median tissue 
culture infectious dose unit (TCID50) of the SARS-CoV CDC strain in a 5 ml volume. A few 
drops of the virus can also be administered on each of the conjunctiva, 0.5 ml in the nose and the 

5 remainder in the trachea. Infection by S ARS-CoV can be assessed by taking nasal, pharyngeal, 
and rectal swabs, and feces from animals for 20 days after challenge (Fouchier et a/. (20030 
Nature 423:240). Hie presence of SARS-CoV in sample materials can be assessed by RT-PCR 
and infection assay of Vero cells. Animals can also be monitored for clinical signs of SARS 
disease by assessing sleeping time, temperature, respiratory symptoms, diarrhea, body weight 

10 and survival. Protection can be determined by the magnitude and duration of virus shedding, by 
duration and severity of disease symptoms, and by percentage of surviving animals. 



Mice 



Group 


Treatment 


Dose/Route 


Sampling interval 


Number of mice 


1-3- 


BPL-SARS-CoV 


20, 10, 5 /ig/SC 


7,21, 35 d; 
2,3,4,5m; 


10 per dose level 


4-6 


BPL-SARS-CoV 


20, 10, 5 ug/SC 


42 d 


10 per dose level 


7-9 


BPL-SARS-CoV MF59 


20, 10, 5 ug/SC 


7, 21, 35 d; 
2,3,4,5 m; 


10 per dose level 


10-12 


BPL-SARS-CoV MF59 


20, 10, 5 ug/SC 


42 d 


10 per dose level 


13 


MF59 


NA/SC 


7, 21, 35 d; 
2,3,4,5 m; 


10 + 10 (sacrificed at 
42 d anid end 5 m) 


14 


Saline 


NA/SC 


7, 21, 35 d; 
2,3,4,5 m; 


10 + 10 (sacrificed at 
42 d and end 5 m) 



Ferrets 



Group 


Treatment 


Route 


Sampling interval 


No. of ferrets 


1 


BPL-SARS-CoV 


SC 


7, 21, 35 d 


6 


2 


BPL-SARS-CoV-MF59 


SC 


7, 21, 35 d 


6 


3 


Saline 


SC 


7, 21, 35 d 


6 



Macaques 



Group 


Treatment 


Route 


Sampling interval 


No. of macaques 


1 


BPL-SARS-CoV 


SC 


1,4, 7 w 


4 


2 


BPL-SARS-CoV - 
MF59 


SC 


1,4, 7 w 


4 


3 


Saline 


sc 


1,4, 7 w 


4 



EXAMPLE 18: Human T cell responses 

As a prelude to initiation of clinical studies in humans, the reactivity of peripheral blood T 
20 lymphocytes from healthy donors with different HLA haplotypes can be assessed using the in 
vitro priming technique (Abrignani et aL (1990) Proc Natl Acad SciUSA 87:6136-40). The aim 
of this study is to have a first indication of the immune-dominant T cell epitopes in SARS-CoV 
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proteins. Briefly PBMCs from 20 healthy donors with different HLA haplotypes will be cultured 
in medium containing 5% autologous sennn, in the presence of different concentration of SARS- 
BPLrCoV particles in the range from 0 J to 20 jig/ml. The expression of activation markers will 
be assessed after 24 and 48 hours. Frequencies of IFN-y- and DL-4- producing T lymphocytes 

5 will be assessed after 12h and after 15 days in culture, in the presence of 100 U/ml recombinant 
human EL-2. Activated and cytokines producing CD4 T lymphocytes will be sorted and 
eventually cloned as single cells using FACS technologies. The CD4+ T cell repertoire from 
human subjects with different HLA will be assessed by proliferation assays of the CD4+ T cell 
lines and clones against autologous EBV-transformed cell lines loaded with 15-mer overlapping 

10 peptides from the most relevant structural and non structural protein of the SARS-CoV. 

When moving to actual human trials, safety and immune responses will be evaluated in 
healthy adults following intramuscular immunization with escalating doses of the BPL- • 
inactivated SARS-CoV vaccine, with MF59 adjuvant being included or omitted depending on 
preclinical data. Three/four immunizations will be given at 0, 1, 6 months in the first cohort, and 

15 atO, 1,2, 6 months and 0,2, 6 weeks in the second and third cohorts respectively. Hie trial will . 
be observer blind and placebo controlled. Subjects will be randomized into each dose level. 
Immune response parameters to be measured will include serum neutralizing antibodies, FT IS A 
antibodies and peripheral blood IFN-gamma-producing CD4+ T cells by intracellular cytokine 
staining. 



Group 


Antigen 


Administration 


No. treated 


No. subjects 


Sampling interval 




dose (jig) 


schedule 


subjects 


with placebo 




Al 


10 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 mos 


A2 


20 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 mos 


Bl 


10 


0,1 ,2,6 months 


18 


12 


0, 1, 2, 6, 7 mos 


B2 


20 


0,1,2,6 months 


18 


12 


0, 1, 2, 6, 7 mos 


CI 


10 


0,2,6 weeks 


18 


12 


0,2,6,10,30wks 


C2 


20 


0,2,6 weeks 


18 


12 


0,2,6, 10,30 wks 



EXAMPLE 19: Selection ofCHO cell lines for Spike protein expression 

Methods for the derivation of Chinese Hamster Ovary (CHO) cell lines that stably express 
viral envelope glycoproteins that are conformationally intact, appropriately glycosylated and 
efficiently bind neutralizing antibodies are well established for HIV and HCV (Srivastava et al. 

15 (2002) / Virol 76:2835^7; Srivastava et al. (2003) / Virol 77:1 1244-259; Heile et aL (2000) / 
Virol 74:6885-92). The same techniques can be applied to SARS-CoV, to generate two different 
stable CHOK-1 cell lines producing either full-length or truncated SARS Spike proteins. The 
Spike proteins can be expressed using the constructs described herein, but without the hexa-EBs 
tags. These proteins can compared for their ability to produce neutralizing antibodies in 

JO immunized animals as well as for their expression levels in CHOK-1 cells. 
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A pCMV3 vector expressing Spike can be used for the derivation of stable CHOK-1 cell 
lines, containing the CMV enhancer/promoter, ampicillin resistance, and a fused DHFR and 
attenuated neomycin gene for selection purposes. Stable cell lines can produced using the 
neomycin selection system in CHOK-1 cells. Clones can be sequenced to verify the integrity of 
5 the insert, and transient transfections can be performed using Trans-LTl polyamine transfection 
reagent (PanVera Corp., Madison, WI) to assess the expression level and also the integrity of the 
expressed protein by BUS A and western blot analysis. 

Initial CHO cells will be selected to be free from TSE/BSE contaminants and risks 
according to relevant regulatory standards. To construct cell lines, procedures involve 
10 transfection, primary screening with selective medium, followed by subcloning to assure purity 
of cell lines. Cell supernatants can be assayed using an antigen capture BUS A to quantify 
expression levels at all stages of selection and amplification. For full-length Spike expression, 
methanol fixed cells can be screened for internal expression by immunofluorescent staining 
using a rabbit anti-SARS antibody. Successive measurements at the T75-flask stage of 
15 expansion canbe employed to assure stability of expression levels. The molecular mass and 
integrity of the expressed proteins can be checked by PAGE both under native and reducing and 
denaturing conditions, followed by iinmunoprobing. 

The pCMV3 vectors expressing SARS-CoV Spike proteins in either full-length or 
truncated forms can be introduced into CHOK-1 cells using the Trans-LT-1 reagent and non- 
20 selective media. 24-48 hours post-transfection, depending on cell density, cells are split at a 1:5 . 
ratio and the medium can be changed to selective media containing neomycin at 500/ig/ml. Any . 
bovine serum used in these procedures will be from TSE-free sources that meet regulatory 
standards. Teri to fourteen days later, -individual colonies can be picked and transferred to 96 
well plates and cultured in complete non-selective medium. When approximately 80% of the 
25 wells are confluent, 24 hour supernatants can be screened by Spike capture EOS A. For initial 
expression of full length Spike protein, cells can be fixed with methanol and screened by 
immunofluorescent staining using a rabbit anti-SARS antibody. After low-expressing cell lines 
have been eliminated and there are fewer than 20-30 cell lines, capture EUSA and western blots 
can then be used to determine the expression level after cell lysis. A portion of each cell line can 
30 be pelleted, weighed and lysed in 1% Triton lysis buffer for determination of expression levels. 
Three to four clones producing the highest levels of spike protein in correct structure and 
conformation can be expanded to three-liter bioreactors and adapted to low serum suspension 
culture conditions for scale-up. 

The antigen capture EOS A assay for the SARS spike protein can be performed using 96 
35 well flat-bottom plates coated with 250ng per well of purified immunoglobulin obtained from 
rabbit sera that were immunized with inactivated SARS virus. Supernatant or lysate samples are 
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added and incubated for 2 hours at 37°C. Bound antigen is reacted against pooled S ARS +ve 
serum or high affinity monoclonal antibody either human or mouse against SARS spike protein 
and detected using appropriate species-specific peroxidase-conjugated second antibody. The 
plates are developed using TMB substrate (Pierce, Rockford, IL), read at a wavelength of 
5 450nm, and the concentration of protein per ml sample is derived from a standard curve (OD vs. 
protein concentration) based on serial dilutions of a known concentration of recombinant spike 
protein. 

The immunoprobing analysis will also be performed following the standard methods 
described by Srivastava et aL (2002) supra. Briefly, 10-20/xl of the sample is analyzed on 4-20% 

10 SDS PAGE under non-reducing/denaturing conditions with mild heating. Hie proteins are then 
transferred onto nitrocellulose membranes and reacted against polyclonal anti-Spike rabbit 
serum, followed by anti-rabbit Ig conjugated to Alexa 688 (Molecular Probes, Oregon). The 
blots are scanned using an infrared imaging system. 

Hie highest expressing candidate cell lines will be screened for Spike protein expression 

15 and stability in small-scale (3 liter) perfusion bioieactors. The candidate clones will be farther .. 
evaluated for level of expression as well as integrity of expressed protein, and subsequently 
tested for expression stability in the absence of selection. The selected clones also will be tested 
for maintenance of the DNA sequence integrity of the integrated SARS spike protein gene. To 
quickly monitor the expression levels in small flasks and in the three liter evaluation cultures, a 

20 lectin-based process (Gluvanthus Nivalis lectin) has been developed to isolate SARS spike 

protein to a degree of purity that allows semi-quantitation and characterization of the protein in . 
CHO supernatant. Full-length Spike protein will be obtained from Triton X-100 detergent 
extracted cells and then captured on GNA lectin, followed by hydroxyapatite and SP 
chromatograph. Eluted protein is then characterized by: (1) polyacrylamide gel electrophoresis 

25 (PAGE) and Coomassie staining, (2) immunoprobing with anti-SARS rabbit sera, (3) structural 
characterization using size exclusion chromatography (SEC), as well as mass spec analysis using 
MALDI-TOR 

Productivity from the CHO cell line expressing SARS spike protein should be at least 2 
mg/L and for full-length Spike protein will be 3mg/100gm of cells, at steady-state cell density. 
30 Yield from one 45 day, 2.5-liter bioreactor will be -1000 mg crude protein. 

EXAMPLE 20: Purification of spike protein for human vaccines 

To purify SARS spike protein for the purpose of producing GMP grade material for human 
use, the following basic process is used, with all steps being performed at 2-8°C: the starting 
material, concentrated CHO cell culture supernatant (20-30X) is thawed and filtered through a 
35 0.45/im membrane; this material is heavily contaminated proteins from culture, as well as DNA; 
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the first purification step is affinity chromatography using Gluvanthus Nivalis (GNA), a lectin 
that preferentially recognizes terminal mannose containing carbohydrates; glycosylated proteins, 
including SARS spike protein are captured and non-glycosylated proteins, as well as DNA, do 
not bind to this column; the GNA column is followed by two chromatographic steps operated in 

5 the flow through mode; the anion exchanger, DEAE, and ceramic hydroxyapatite (cHAP); 
DEAE binds some contaminating supernatant proteins and DNA, whereas cHAP binds any 
contaminating serum proteins; full-length Spike protein is purified from the cell pellet; the cells 
are lysed with Triton X-100 and full-length Spike protein is then captured on GNA lectin, 
followed by hydroxyapatite and SP chromatography. 

10 The purified SARS spike can be further treated to remove adventitious viruses: viral 

inactivation at pH 3.5 for 1 hour, the sample is then concentrated and diafiltered into a buffer at 
pH 4 and finally captured the purified protein using SP resin; the spike protein binds to this resin 
and many viruses flow through. 

The spike protein is eluted, concentrated and diafiltered into formulation buffer. This 

15 formulated bulk product is then filtered through a DV50 viral removal membrane followed by 
filtration through a 0.2 /nn membrane. The formulated bulk is filled into suitable containers e.g. 
into 3.0 ml vials, in a class 100 laminar flow hood. 

In process testing at each step of the purification includes protein concentration, endotoxin 
(LAL), bioburden, and recovery. 

20 Prior to human administration, a test for potency will evaluate the specific ability of the 

vaccine in an invitro or in vivo test to effect a given response. The in vivo immunogenicity will 
be determined by dosing groups of 10 mice with various doses of the protein antigen. Sera will 
be analyzed for the presence of IgG antibodies using an ELISA. The criterion for passing will be 
based upon the number of vaccine treated animals that are seropositive compared to a reference 

25 standard. Other tests include General Safety, sterility, purity, identity of the vaccine (using an 
ELIS A specific for Spike protein), and quantity & protein concentration (UV spectrophotometric 
absorbance procedure based on the molar absorbance of the aromatic amino acids). 

Stability testing will be performed on the bulk drug substance and on the final container 
product Bulk product will be evaluated at temperatures of -60°C (recommended storage 

30 condition), 25 ±2 °C and 40 ±2 °C protected from light, at time points of 0, 3, 6, 9, 12 months. 
Final container product will be tested at temperatures of -60 °C, and inverted at 5 ±3 °C, 25 ±2 
°C, and 40 ±2 °C at time points of 0, 3, 6, 9, 12 months. Stability-indicating assays may include 
appearance, pH, protein content, SDS-PAGE, size exclusion HPLC, and container/closure 
integrity, performed on single samples of bulk and triplicate vials of final container material. 

35 The protein purified in this way can be evaluated in mice, rabbits and ferrets as described 

in, and based on the results of, examples 4, 5, 8 and 9 above. 
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Initial experiments will be performed in mice to determine optimal dose and schedule of 
the GMP Spike protein required to elicit the highest levels of neutralizing antibody, with titers at 
least in the range of 1/100 - 1/1000. Spike protein will be tested in the range from 5 to 40 /ig, 
alone or mixed with an equal volume of MF59-citrate, to anesthetized mice in 100/il of 

5 inoculum. Groups of BALB/c mice, 10 per treatment, will be immunized Hie animals will be 
primed at day 0 and boosted at days 14 and 28. Secondary endpoints will be to compare the 
kinetics of neutralizing vs. Spike-specific antibody titers and to assess the Thl/Th2 profile of the 
specific immune response. Neutralizing and Spike-specific antibody titers will be assessed at 
days 7, 21 , and 35 and at 2, 3, 4, and 5 months after priming; the IgG2a and IgGl titers of Spike- 

L0 specific antibodies will be determined at days 21 and 35, and at 2, 3, 4, and 5 months after 

priming; proliferation and IFN-7 and ILr4 production by splenic T cell against the recombinant 
Spike protein from SARS-CoV will be assessed at day 42 and at the end of the 5th month. 
Peripheral blood will be collected at days 7, 21, and 35 and at 2, 3, 4, and 5 months after 
priming; spleen cells at day 42 and at the end of the 5th month. Neutralizing and Spike-specific 

15 antibody titers and isotypes will be determined by inhibition of SARS-CoV infection of Vero 
cells and by ELB A, respectively. Proliferation of splenic cells will be determined by 3 [HQ- 
thymidine uptake. Frequencies of splenic IFN-y and IL-4 producing CD4+ T lymphocytes, will 
be determined by ELISPOT and FACS analysis. 

Next, the optimal dosing and schedule for recombinant Spike vaccine will be determined in 

20 ferrets. Based on the mouse results, the Spike vaccine eliciting the highest antibody neutralizing 
titers will be tested against a two-fold higher dose of recombinant Spike protein given in the 
same formulation. Three groups of ferrets, 6 per treatment, will be immunized SC under 
anesthesia with 200/il of inoculum. The animals will be primed at day 0 and boosted at days 14 
and 28. Peripheral blood will be collected at days 7, 21, and 35. Neutralizing and Spike-specific 

25 antibodies titers will be determined by inhibition of SARS-CoV infection of Vero cells and by 
EOS A, respectively. Similar to the previous ferret studies, each group of animals will be used to 
assess efficacy of the vaccine in protecting immunized animals from infection and/or disease. 

Immunogenicity and efficacy of the candidate vaccine also will be evaluated in nonhuman 
primates. Three groups of adult cynomolgus macaques, 4 per treatment, will be immunized with 

}0 recombinant SARS-CoV Spike protein, alone or mixed with an equal volume of MF59-citrate, 
administered SC to anesthetized animals in 500 /xl of inoculum. The Spike protein vaccine will 
be tested at the dose eliciting the highest neutralizing antibody titers in ferrets at day 35. Hie 
animals will be primed at day 0 and boosted at 3 and 6 weeks. Peripheral blood will be collected 
at weeks 1, 4, and 7. A secondary endpoint will be to assess the Thl/Th2 profile of the specific 

35 immune response, as described above (neutralizing and Spike-specific antibody titers, 
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frequencies of peripheral blood CD4+ T cells producing IFN-y and IL-4 in response to the 

recombinant Spike protein, assessed at at weeks 1, 4, and 7). 

Finally, human phase I, placebo-controlled, dose-escalation, safety/ immunogenicity trials 

will be performed for the IM recombinant SARS vaccine with MF59 adjuvant Hie trial will 
5 evaluate safety and immune responses in healthy adults following immunization with escalating 

doses of SARS recombinant vaccine with MF59 adjuvant, administered intramuscularly. 

Three/four immunizations will be given at 0, 1, 6 months. The trial will be observer blind and 

placebo controlled- Subjects will be randomized into each dose level. Immune response 

parameters to be measured include serum neutralizing antibodies, ELISA antibodies and 
10 peripheral blood IFN-7-pioducing CD4+ T cells by intracellular cytokine staining: 



Group 


Vaccine 
Antigen 
dose (ug) 


Administration 
schedule 


No. of 

treated 

subjects 


No. of subjects 
with placebo 
(MF59) 


Sampling interval 


Al 


50 


0,1,6 months 


18 


6 


0, 1, 2, 6, 7 months 


A2 


100 


0,1,6 months 


18 


6 


0, 1,2, 6, 7 months 



EXAMPLE 21: Comparison of inactivated virus and purified Spike protein 

Immunogenicity and efficacy of the inactivated virus vaccine and the purified Spike 
protein can be compared in non-human primates. Three groups of adult cynomolgus macaques, 4 

15 for treatment, will be immunized with recombinant SARS-CoV Spike protein from CHO cell 
lines or with BPL-S ARS-COV, given in the dose and formulation eliciting the highest 
neutralizing antibody titers in previous immunogenicity challenge experiments, administered SC 
to anesthetized animals in 500/xl of inoculum. The animals will be primed at day 0 and boosted 
at 3 and 6 weeks. Peripheral blood will be collected at weeks 1, 4, 7. A secondary endpoint will 

20 be to assess the Thl/Th2 profile of the specific immune response, as described above. 



Group 


Treatment 


Dose/Route 


Sampling interval 


No. of macaques 


1 


Rec-Spike protein 
+ or - MF59 


Y/tg/SC 


1,4, 7 w 


4 


2 


BPL-SARS-CoV 
+ or-MF59 


Yjig/SC 


1,4, 7 w 


4 


3 


Saline 


NA/SC 


1,4, 7 w 


4 



EXAMPLE 22: Expression in yeast 

Yeast is a useful and inexpensive eukaryotic expression system. Yeast-expressed proteins 
are used in recombinant hepatitis B virus vaccines, and recombinant SARS antigens may also be 
25 expressed in yeast for vaccine purposes. Yeast-expression is also convenient for the production 
of antigens for preparing monoclonal and polyclonal antitobodies, or for use in serological 
assays. 
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The nucleocapsid protein (N) and two different versions of the spike glycoprotein (S) from 
SARS coronavirus FRA strain (AY310120) were cloned for expression in S.cerevisiae: 
SARS N: aa 1 - 422 (coordinates 28120-29388 of AY310120 strain) - Rg.65 
SARS spike: aa 14 - 1195 (transmembrane domain and cytoplasmic tail deleted) - Kg.66 
5 SARS spike: aa 14 - 662 (SI domain) 

To make the SI construct, a XhoI-NotI fragment of approximately 3733bp encoding the full- 
length spike glycoprotein was the starting point PCR was used to amplify the full-length gene 
in two pieces: Xbal-Blnl of 2440bp and Blnl-Sall of 1306bp. These fragments were subcloned 
into commercial vectors (Novagen): pT7Blue2 Xbal-Blnl (5' end of spike glycoprotein) and 
10 pT7Blue2 Blnl-Sall (3' end of spike glycoprotein; Figure 58), respectively. The following 
primers were used in the subsequent PCR reactions: Spk-1 (5 1 ) SEQ ID NO: 9785; Spk-2 (50 
SEQ ID NO: 9786; Spk-3 (5 1 ) SEQ ID NO: 9787; Spk-4 (5 1 ) SEQ ID NO: 9788. 

* E. coli HB101 competent cells were transformed with the PCR ligation product and plated * 
on Luria agar plates, containing 100/ig/ml ampicillin. The desired clones were identified using 
15 miniscreen DNA analysis. After sequence verification and plasmid amplification of the desired ' 
subclones, it was desirable to eliminate the internal Sail site present in the Xbal-Blnl portion of 
the spike sequence in order to facilitate future cloning into the yeast expression vector (BamHI- 
SalT). Therefore, we prepared a CelH-Mfel vector from the pT7Blue2 Xbal-Blnl (5 1 end Spike) 
subclone to eliminated a 143bp sequence containing the Sail site. Kinased oligos DS1-6 (SEQ 
20 ID NOS: 9789-9794) were then ligated into the Celll-Mfel vector to replace the 143bp that were ■ 
removed to mutate the Sail site (no aa changes), creating pT7Blue2.XbaI-BlnIAsal. 

The 5* Xbal-Blnl (from pT7Blue2 JCbal-Blnl ASal) and the 3' Blnl-Sall (from pT7Blue2 
Blnl-Sall) spike glycoprotein inserts were gel-purified and ligated them into the p893-l Xbal- 
Sall vector (a vector derived from plitmus 38 (New England Biolabs) with the alpha-factor 
25 leader sequence cloned into the BamHI -Sail sites of the MCS). The resulting full-length SARS 
Spike coding sequence was named p893-l.SARS Spike 1255 #9 (Figure 58). 

Exoli HB101 competent cells were transformed with the oligo replacement ligation 
product and plated on Luria agar plates, containing 100/xg/ml ampicillin. The desired clones 
were identified using miniscreen DNA analysis. After sequence verification of the positive 
30 clones, pT7Blue2 Xba-Bln ASal was chosen for use as a template for PCR reactions to amplify 
the Spike SI 1967 bp Xba-Sal fragment The fragment was then subcloned into the p893-l Xba- 
Sal vector, sequence verified, and named it p893-l. Spike SI #11 (Figure 59). 

In order to clone into the S.cerevisiae expression vector, pBS24.1, the 5* end of the SI 
sequence had to be modified from Xbal to HindHI to allow ligation with the 3 1 Hindin end of the 
35 ADH2/GAPDH Bamffl-HIndin promoter fragment From pT7Blue2 Xba-BlnASal (described 
above) an Agel-Sall 1943bp fragment was gel-purified. This fragment was ligated along with a 
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synthetic pair of HindE-Agel 30bp kinased oligos (Sl-l+Sl-2 creating the necessary 5' EBndlll 
site) into the pSP72 HOndOI-Sall commercial subcloning vector (named pSP72.SARS Spike SI 
#2; Figure 59). Sl-1 had SEQ ID NO: 9795 and Sl-2 has SEQ ID NO: 9796. 

After sequence verification of the positive clone from miniscreen DNA analysis, the 
5 Hindlll-Sall fragment was gel purified Hie 1365 bp Bamffl-Hindin ADH2/GAPDH promoter 
fragment was ligated along with the 1973 bp HmdHI-Sall SI fragment into the pBS24.1 BamHI- 
Sall vector creating the genetically engineered pdSARS Spike SI #2 expression plasmid (Figure 
60). 

S.cerevisiae strain AD3 was transformed with pcLSARS Spike SI #2 and single 
10 transformants were checked for expression after depletion of glucose in the medium. Hie 
recombinant protein was expressed at high levels in yeast, as detected by Coomassie blue 
staining. In particular, yeast cells were transformed with the SARS SI expression plasmid using 
the Invitrogen S.c. EasyComp™ Transformation Kit Expression in shown in Figure 57; 

To express Spike 1195 protein, which does not contain the trans-membrane (TM) region or 
15 cytoplasmic tail that are present in the full-length SARS construct, the following series of genetic 
manipulations was performed: 

. ' From pT7Blue2 Blnl-Sall #1 1 (described above) a Blnl-Dral 1056bp fragment was gel 
purified This fragment was ligated with a synthetic pair of 68bp Dral-Sall kinased oligos 
. (DRS1+2; SEQ ID NOS: 9797 & 9798) into a pT7Blue2 Blnl-Sall vector (Figure 61). Kcoli 
20 HB 101 competent cells were transformed with the oligo replacement ligation product and plated - 
on Luria agar plates, containing 100/xg/ml ampicillin. The desired clones were identified using 
miniscreen DNA analysis. After sequence confirmation the clone was named pT7Blue2 Blnl- 
Sal Spike 1 195 #7. The 1 126bp Blnl-Sall fragment encoding the 3' end of the Spike 1 195 was 
gel purified (Rg.61). 

25 In order to generate the Xbal-Sall Spike 1195 fragment, the 3109bp Xbal-Pcil fragment 

was isolated from the p893-l.SARS Spike 1255 #9 (described above) and a 457bp Pcil-Sall 
fragment from pT7Blue2.S ARS Spike 1 195 #7 (described above). The two fragments were 
cloned into the p893-l Xbal-Sall vector, creating the p893-l.SARS Spike 1195 #34 plasmid 
(Figure 62). 

30 To clone SARS Spike 1195 into the pBS24.1 Saccharomyces cerevisisae expression 

vector, it was necessary to modify the 5' end of the SARS Spike 1 195 from Xbal to Hindm, as 
done for the Spike SI expression clone described above. To begin, the 2416bp Agel-Blnl 
fragment was isolated from p893-LSARS Spike 1195 #34. Tlris fragment was ligated with the 
synthetic HindlE-Agel 30bp oligos (described above to generate the SI protein for expression in 

35 S.cerevisiae) into the pT7Blue2 HindlE-Blnl vector. B. coli HB101 competent cells were 
transformed with the oligo replacement ligation product and plated on Luria agar plates, 
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containing 100/ig/ml ampicillin. The desired clones were identified using miniscreen DNA 
analysis. After sequence verification of the positive clone and plasmid amplification of 
pT7Blue2.SARS 1195 5' HindlE-Blnl #10 (Figure 63), we isolated a 402bp HindTlT-Ncol 
fragment and the 2044bp Ncol-Blnl fragment (Hgure 63). It was necessary for the EBndm-Blnl 

5 isolation to be done in two steps to avoid cloning issues related to the internal HindTTT site 
located at nucleotide number 1319 of the spike 1195 protein. 

To assemble the BamHI-Saltexpression cassette of Spike 1195 into the pBS24.1 vector 
KcoliBBlOl competent cells were transformed with the the Bamffi-Hindni (ADH2/GAPDH 
promoter), HmdlH-NcoI 402bp fragment, Ncol-Blnl 2044bp and the Blnl-Sall 1 126bp fragments 

10 into the pBS24.1 BamHI-Sall vector. Hie samples were plated on Luria agar plates, containing 
100/ig/ml ampicillin. The desired clone was identified using miniscreen DNA analysis, thus 
creating the genetically engineered pASARS Spike 1195 #10 (Hgure 64). 

, S.cerevisiae strain AD3 was transformed with pdSARS Spike 1195 #10 and single 
transformants were checked for expression after depletion of glucose in the medium. The 

15 recombinant protein was detected by Coomassie blue staining. In particular, yeast cells were 
transformed with the SARS 1 195 expression plasmid using the Invitrogen S.c. EasyCompT™ 
Transformation Kit 

EXAMPLE 23: Expression in mammalian cell lines 

cDNA fragments containing the S protein ORF of 1255 amino acids were amplified by 

20 RT-PCR from SARS viral RNA (Frankfurt isolate) grown in Vera cells. The amplified PCR 
fragments were cloned into pBlueScript vector, sequenced, and consensus spike sequence was 
assembled to create a full-length SARS spike clone, pBSnSh. In vitro transcription of pBSnSh 
followed by translation in a rabbit reticulocyte lysate resulted in the production of single 
polypeptide with an estimated molecular mass of -140 kDa. 

15 The insert of this plasmid was recloned via Xhol and Not I into a mammlian expression 

vector pCMVUI (Srivastava et aL (2003) J. Virol 77:11244-11259) to create a construct, nSh 
(Fig. 74A). A PCR fragment containing a spike protein of 1 195 amino acid, which was deleted 
for transmembrane (TM) domain and cystein-rich cytoplasmic tail (Cy) was amplified and 
cloned pCMVUI vector to generate the contstruct nShATC (Figure.74B). Both constructs were 

50 tagged with six histidine residues at the C-terminus in order to aid in their characterization. The 
Xho I/Not I fragment without a histidine tag also was subcloned into the alphavirus replicon 
vector backbone pVCRchim2.1 for use in the production of an alphavirus replicon particle 
chimera that expresses S protein. Production and characterization of the replication defective 
alphavirus vector particles was performed essentially as described previously (Perri et aL (2003) 
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/. Virol 77:10394-10403; Polo et aL (1999) PNAS USA. 96:4598-4603). The resultant alphavirus 
vector particles wane named as VEE/SIN. 

COS7 cells and BHK-21 cells were maintained in Dulbecco's modified Eagle's medium 
supplemented with 10% fetal bovine serum at 37°C and 5% C0 2 in air. COS7 cells were 

5 transfected with expression plasmids (nSh, nShATC) using a transfection kit (TransIt-COS, 
Minis) following the manufacturer's protocol. The cells were washed once with ice-cold PBS 
and lysed with lx Lysis buffer (20mM MOPS, lOmM NaCl, 1.5mM MgCl 2 , and 1% Triton X- 
100) containing complete mini protease inhibitor (Roche). After a 30-min incubation on ice, the 
debris was cleared by centrifugation. The cleared lysate was either purified or used directly in 

10 western blotting. 

To purify secreted spike proteins, medium from transfected cells was collected and 
subjected to centrifugation at 12,000 rpm for 10 min to remove cellular debris. The cleared 
medium was applied to a ConA-agarose column (V ector Lab). The column was washed 
extensively with 20mM sodium phosphate buffer, and then the bound proteins were eluted with 

15 1M methyl a-D-mannopyranoside (MMP), 1M NaCl in 20mM sodium phosphate buffer: 
Column fractions containing SARS-CoV spike proteins were applied to MagneHis Protein 
purification system (Promega) following the protocol suggested by the manufacturer. 

For western blot analysis, proteins were separated by 4-20% SDS-PAGE and then 
transferred electrophoretically to nitrocellulose membrane (Invitrogen). Membrane was blocked 

£0 in blocking buffer (5% skim milk and 0.1% Tween 20 in PBS) and incubated with indicated 
antibody at room temperature for 1 hr, washed and probed with horseradish peroxidase (HRP)- 
conjugated secondary antibody (Biosource) followed by chemiluminescence (ECL system, 
Amersham) and exposed by X-ray films. The antibodies used wore a mouse monoclonal anti- 
histidine antibody (anti-His # tag Mab, Novagen), a rabbit polyclonlal antipeptide antibody against 

25 S ARS-CoV spike proten (SmPab, Abgent), or rabbit anti-SARS sera (2BE) obtained by 
immunization of rabbits with purified SARS-CoV virion. The latter has a cell culture 
neutralizing titer of 1/2,500. Unless stated otherwise, antibody was used at 1/1,000 for anti- 
histidine antibody and SmPab and 1/10,000 for anti-SARS rabbit sera. 

Some spike proteins were treated with Peptide-N glycosidase F (PNGase F). Cell lysates 

30 were diluted in 0.5% SDS and 1% P-mercaptoethanol and denatured at 100°C for 10 min. After 
2-fold dilution with 1% NEMO in 50mM sodium phosphate (pH 7.5), the samples were treated 
with PNGase F (NEB) at 37°C for 1 hr. Enzyme-treated samples were analyzed by 4-12% SDS- 
PAGE in reducing condition. For a partial digestion with the PNGase, the cell lysates were 
diluted with 50mM sodium phosphate (pH 6.0) containing 0.75% Triton-X and treated with 

35 PNGase F (Calbiochem) at 37°C for 3 hr. Enzyme-treated samples were analyzed by 4-20% 
SDS-PAGE in nonreducing condition. 
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Western blots of cells 48-hours after transfection are shown in Hgure 75. Hie S protein 
was detected in cell lysates as a doublet with estimated molecular weight of -170 -180 fcDa , 
when the lysate was boiled and analysed under reducing SDS-PAGE conditions (Rg. 75 A, lane 
3). This doublet appears to result from differential glycosylation of one polypeptide product 

5 since pre-treatment of the cell lysate with PNGase F reduced the doublet to a single species of 
-140 kDa (Fig. 75A, lane 4). This is the expected size predicted from the aa sequence for a full- 
length, intact polypeptide product This experiment indicates that the full length SARS-CoV S is 
expressed in mammalian cells as a single, uncleaved polypeptide, but in two differentially 
glycosylated forms, gpl70 and gpl80 respectively. Unlike the two S glycoforms encoded by the 

10 full-length sequence, none of which were secreted, the SA protein product was detected both in 
cell lysates (Fig. 75A, lane 5) as well as in the cell culture medium (Rg. 75B, lane3) as a single 
species of ~ 160 kDa. 

In order to further characterize the intracellular processing of the S protein, and as 
described above, BHK21 cells were infected with defective alphavirus particles expressing the 

15 full-length S. At 6 hr post infection with a MOI of 5, infected cells were pulse labeled for 1 hr 
with I^[ 35 S] methionine/cysteine and chased for 2 or 4 hours. The [ 35 S]-labeled S protein was 
immuno-precipitated using the rabbit antiserum raised against inactivated, purified virus and 
then digested with Endo H. The Endo H treatment involved dilution with a sample buffer (50mM 
sodium phosphate, 0.1% SDS, 50 mM DTT, pH 6.0) and boiling for 5 min. After denaturation, 

50 the samples were further diluted with 0.75% Triton-X 100 and treated with endoglycosidase H 
(Endo H) following manufacturer's protocol (Calbiochem) for 3 hr at 37°C. Enzyme-treated * 
samples were added with gel loading buffer containing 0.1% SDS and DTT and analyzed by 8% 
SDS-PAGE. 

Both digested and undigested proteins were boiled in SDS and analysed by reducing SDS- 
:5 PAGE (Figure 55). After a 1-hr pulse, the S protein was apparent as a single gpl70 component 
that was Endo H sensitive (lanes 1 and 2). After a 2-hr chase, a new species (gpl80) was present 
along with gpl70 in approximately equal proportions (lane3). After a 4-hr chase, the gpl80 
species was the dominant S protein component Qane 5) that was now Endo H resistant (lanes 5 
and 6). This data is consistent with gpl70 being an ER-resident glycoprotein containing high 
0 mannose chains and with gpl80 corresponding with a Golgi-processed glycoprotein containing 
Endo H-resistant complex oligosaccharides. 

The Endo H sensitivity of the C-terminus deleted SA protein purified from cell culture 
media ws also tested. As shown in Figure 76, the S A observed within cell lysates was found to be 
Endo H sensitive (lanes 1 and 2), while the secreted SA in cell culture media was Endo H 
5 resistant (lanes 3 and 4). This result is consistent with this glycoprotein being synthesized in an 
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immature form in the ER prior to transfer to the Golgi where the complex carbohydrate is added 
and the protein then secreted 

As already described, the S protein expressed in COS7 cells was detected as a 
gpl70/gpl80 doublet in western blot analyses of cell lysates that were fully denatured by boiling 
5 in the presence of DTT. However, the majority of S protein was detected as a high molecular 
glycoprotein in the 440-669 kDa range when the same cell lysate was not heat-denatured prior to 
western blot analysis using SDS-PAGE (Rg. 77, lane 1). Hie -500 kDa species was resistant to 
10 mM DTT treatment (lane 3) and not dissociated into the monomelic form unless the lysate 
was first heat-denatured at 100°C (lane 4). In contrast, oligomeric form of a test protein 

10 (Thyroglobulin) of which quaternary structure is held by disulfide-linkage was converted into 
subunit form by the 10 mM DTT treatment These data suggest that the -500 kDa oligomeric 
form of S protein is not disulfi de-linked and is heat labile. To confirm the heat-sensitivity of the 
-500 kDa species of S protein, the heat-denaturation experiment was repeated but without DTT. 
As shown in Figure 78, heat denaturation of -500 kDa protein at 100°C alone was sufficient to 

15 convert it into gpl70/180 monomeric forms (lane 4). Using a 80°C heat-denaturation step, both ; 
the -500 kDa and monomeric forms were detectable in similar proportion (lane 3). 

In order to investigate further whether this -500 kDa species represents an S protein 
oligomer in native conformation, comparative analyses with virion-derived S glycoprotein 
derived from Vero cell cultures was performed. The purified virions were solubilised in 1% SDS 

20 prior to Western blot analyses after SDS PAGE. The presence of the -500 kDa spike protein 
oligomer was confirmed in virion particles (Fig. 79, lane 1). In addition, heat denaturation of 
solubilised virions produced the same oligomer-to-monomer conversion as sera with the full- 
length recombinant S (lanes 23)- The oligomeric nature of virion S was further analysed in a 
cross-linking experiment Aliquots of inactivated virion from sucrose gradient fractions were 

25 treated with 10% SDS at 1% final concentration and diluted 2-fold with 0.2M Triethanolamine- 
HC1 (pH 8, Sigma); Dimethyl suberimidate (DMS; Pierce Chemical Co.) was then added from a 
freshly prepared solution (lOmg/ml in 0.2M Triethanolamine-HCl) at 3.3mg/ml final 
concentration. After 2 hr at room temperature, samples were concentrated with Centricon-30 and 
analyzed by silver staining after electrophoresis on a 4% polyacrylamide gel. Both untreated and 

30 DMS cross-linked virion proteins were heat-denatured, and the heat effect on the maintenance of 
oligomer structure was analysed by SDS-PAGE and silver staining (Figure 80). In the absence of 
cross-linking, heat denaturation resulted in the replacement of the -500 kD spike protein species 
with the monomer species. In contrast, in the cross-linked proteins, the levels of the -500 kD and 
monomer species did not change significantly after heating. These data support the fact that the 

35 -500 kD protein is an oligomer of S monomer proteins that are bound non-covalently. After 
cross-linking and boiling, the -500 kDa species migrated as a somewhat slower diffuse form 
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than the untreated form. This mobility shift is probably due to a structural change resulting from 
boiling. In addition, a minor protein species of -300 kD, which may represent a non-dissociated 
S dimer, could be seen. 

To estimate more precisely the size of the recombinant -500 kDa S species expressed in 
5 COS7 cells, a COS7 cell lysate containing the S protein oligomer was fractionated using size- 
exclusion column chromatography. The major portion of the -500 kDa oligomer co-eluted with a 
572 kDa marker protein. Taken together, these experiments suggest that the -500 kDa S species 
seen in COS7 cell lysates is probably a homotrimer of the S protein monomer. 

The oligomeric status of the S A spike protein was also examined after expression in COS7 

10 cells. As shown in Figure 81, the recombinant SA proteins present in cell lysates were also 

detected in high molecular weight forms of -500 kDa range when the lysate was not heated prior 
to SDS-PAGE and Western blot analysis (lane 1). However, the efficiency of oligomerization by 
intracellular SA protein appears to be much less (<10%) compared to that of full-length S protein 
under the same western analysis conditions. A heat-sensitivity test on this -500 kDa protein 

1 5 showed that the SA oligomer was more heat labile than that of the full-length 5 oligomer, as 
demonstrated by the >90% conversion of all of the -500 kDa species into monomelic Sd forms 
at 80°C (lane 2). Also (Figure 82), the majority of the secreted SA protein was found in 
monomeric form with the -500 kDa species barely detectable (and only detectable when ihe 
protein was loaded in excess for Western analysis) (lane 1). At a temperature above 80°C, all 

20 secreted SA proteins were detected as monomers (lanes 2, 3). 

The -500kDa protein is glycosylated, and the effect of deglycosylation on its antibody 
binding was investigated The recombinant COS7 lysate was treated with PNGase F under non- 
denaturing condition (as described above) and analysed by western blot As shown in Figure 83, 
deglycosylation did not affect the binding of anti-histidine Mab antibody to the treated S 

25 oligomer (lanes 23). However, it compromised the reactivity with the rabbit antisera raised 

against purified virus flane 6). This antiserum binds to virion-derived S in western blot analyses 
only when DTT is omitted from the sample for SDS-PAGE indicating that it recognizes 
primarily a discontinuous, conformational epitope(s). This antisera has also been shown to have 
a high-titer of viral neutralizing antibodies. Its lack of binding to deglycosylated, recombinant S 

30 suggests that the carbohydrate actively contributes to the higher order, native structure of the S 
polypeptide oligomer. 

The difference between the recombinant S and S A protein is the presence or absence of the 
TM-and Cys-rich domains at the C-terminus. This difference predicts that full-length S would be 
found associated with the membrane fraction while Sd would be in the soluble fraction upon 
55 lysis of transfected cells. Therefore, nSh- or nShATC-transfected cells were lysed under 
hypotonic conditions and the soluble cytosol fraction was separated from the insoluble 
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membrane fraction by centrifugation (Figure 48). As shown in Figure 84, the S protein was 
found in the membrane fraction (DF) both as a -500 kDa and 180/170 kDa species (lane 4) but 
was not detectable in the soluble cytosol fraction (AF) (lane 3). However, the truncated SA 
protein was found as a monomelic species (gpl70) in both fractions (lanes 5,6). This indicates 
5 that the C-terminal TM and Cys-rich domains are required for the anchorage of the S protein to 
cell membrane. 

The cellular location of the S and S A proteins in COS7 cells was analyzed by indirect 
immunofluorescence microscopy. At 48 hr post-transfection, cells were directly fixed with 2% 
paraformaldehyde without detergent for cell surface staining or treated with detergent followed 

10 by Cytofix/Cytopenn solution for intracellular staining. Fixed cells were then stained with rabbit 
anti-SARS sera (2BE) and FETC-conjugated antibody. The nSh-transfected cells showed foci of 
S protein indicative of Gdlgi-localisation (Figure 85A), while the nShATC-transfected cells 
displayed a uniform distribution of S A protein throughout the cytoplasm indicative of ER 
localisation (Figure 85B). While the complete S protein was also observed on the surface of 

15 transfected cells in unfixed cells (Figure 85D), the SA was undetectable on the cell surface 
(Figure 85E). These results indicate the role played by the TM-and Cys-rich domains in 
anchoring the S protein to the plasma membrane. Although the TM-region alone is likely 
responsible for membrane anchorage, the potential role played by the Cys-rich region remains to 
be determined 

20 The SARS recombinant full-length S protein is thus an N-linked glycoprotein with an 

estimated molecular weight of 170-180,000 kDa. Deglycosylation with PNGase F resulted in a 
polypeptide of the expected size for the uncleaved, encoded polypeptide (140kDa). Both 
transient and stable expression of the full-length SARS-CoV S gene in a variety of mammalian 
cells, including COS7, 293, BHK21, and Huh7 cell lines, consistently produced a S protein 

25 doublet (gp 170/1 80) as detected in western blot analyses.. Pulse-chase analyses of transfected 
cells demonstrated that the SARS CoV S protein was initially synthesized as an Endo H sensitive 
gpl70 species followed by the gradual appearance of an Endo H resistant gpl80 form, 
presumably as a result of the addition of complex carbohydrate within the Golgi apparatus. 
The recombinant S protein was not secreted into the cell culture medium unless the C- 

30 terminal 60 amino acids containing the TM-region and the Cys-rich tail were deleted. 

The quaternary structure of the full-length recombinant S protein was investigated using 
cross-linking treatment, heat-denaturation, and size fractionation analyses. The results data are 
consistent with the recombinant S protein existing as a homotrimer of ~500kDa. Similar analyses 
of virion-derived S yielded the same results. Such a trimeric structure has been reported for other 

35 enveloped RNA viruses: the hemagglutinin HA of influenza virus, the E1-E2 heterodimer of 
alphaviruses and the G protein of vesicular stomatitis virus. Incubations under reducing 
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conditions indicate that the SARS-CoV S trimeric structure is non-covalently associated, and is 
very stable. S oligomers present in the cell lysate were shown to be resistant to reduction by 10 
mM DTT, detergent treatment with 1% SDS, and heat denaturation at up to 60°C . Incubation at 
a temperature higher than >80°C resulted in the dissociation of the trimeric complex as 

5 evidenced by the decrease in trimer with the concomitant increase in the monomer bands. The 
temperature-induced appearance of the high-mannosylated gpl70 (ER monomer form) as well as 
the complex-glycosylated gpl80 (Golgi monomer form) suggests that trimerization can occur 
before the transport of the monomer spike protein to the medial Golgi apparatus. This is 
consistent with other reports for TGEV, influenza virus HA, and vesicular stomatitis virus G 

10 proteins. With these proteins, trimerization was reported to take place before addition of complex • 
oligosaccharides in the Golgi apparatus. 

. The C-terminally truncated form of S was found in the cell lysate in both oligomeric and 
monomeric forms at a frequency of 10% and 90%, respectively. The truncated protein secreted 
into medium was found fully glysosylated and it was essentially all in monomeric form. We 

15 conclude that the C-terminal 60 amino acids of the S glycoprotein contains a membrane anchor 
region that affects the efficiency of trimerization. In S protein trimerization, it is possible that the 
C-terminal region is required to initiate the event and the triple-stranded coiled coil structures in 
the S2 stalk domain provide further stabilizing force as seen in HA oligomer of influenza virus. 

EXAMPLE 24: CHO cells for Spike protein expression 
JO CHO cell lines that stably express either the full-length or truncated S ARS-CoV spike 

proteins were prepared. Several stably transfected CHO cell lines were obtained, and Figure 73 
shows western blot data from a panel of representative clones. 

EXAMPLE 25: Expression in E.coti 

All SARS-CoV ORFs (Figure 17, Table 10) were cloned in the pET vector and expressed 
15 as C-terminal His-Tag fusion proteins in Ecoli. The proteins smaller than 16KD were also 
expressed as N-tenninal GST (Glutathione S-transferase) fusion proteins using pGEX vector. 

Nspl and Nsp2, the two SARS-CoV proteins with proteolytic activity, were not expressed 
as full length proteins due to toxicity in Exoli. The respective genes were instead cloned in 
different portions in order to separate* the catalytic residues (Cys833/ His994 for Nspl; 
30 His41/Cysl45 for Nsp2) in the resulting recombinant proteins: Nspl A from nucleotides 2719- 
5214 of AY310120; NsplB from nucleotides 5218-7371; NsplC from nucleotide 7372-9984; 
Nsp2A from nucleotide 9985-10416; Nsp2B from nucleotide 10476-10902. 

Nsp9 (SEQ ID NO: 9775) was divided into two portions: Nsp9A from nucleotide 
13371- 14756; Nsp9B from nucleotide 14757-16166. 
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Matrix (M), ORF3 and 0RF7 contain respectively three, two and one transmembrane 
domains. These proteins were expressed as deletion proteins excluding the first 100 amino acids 
(M and ORF3) or the first 18 amino acids (ORF7) that include the hydrophobic regions. 
Hie cloned sequences are shown in Table 26. 

5 A two-step strategy was used to amplify the cloned sequences. In the first step, 

amplification of DNA fragments containing more than one gene or single gene used sequenced 
cDNA as template. Eleven cDNA sequences were amplified: (1) a fragment, named amplCl, 
including genes coding for protein E, protein M, orf 7-8-9-10; (2) a fragment, named amplC2, 
including genes coding for orf 3-4; (3) a fragment, named amplCS, including genes coding for 

10 proteins Nspl2 and Nspl3; (4) Nspllgene; (5) P28 and P65 genes; (6) NsplB and NsplC genes 
portion; (7) a fragment, named amplC9, including genes coding for proteins Nsp2 and Nsp3; (8) 
a fragment, named amplNsp4-7, including genes coding for proteins Nsp4, Nsp5, Nsp6, Nsp7 
and for amplification of Nsp9A gene portion; (9) Nsp 9B gene portion and NsplO gene; (10) a ■ 
fragment, named amplCO, including genes coding for proteins Orf 1 1, Nucleocapsid (N) and 

15 Orf 12; (11) NsplA gene portion. The primers used in this first step are given in Table 27: 

. In the second step, amplification of single genes was performed using DNA fragments 
from the first amplification step as templates. The primers are shown in Table 28. 

Of the proteins where expression was seen, it was either in inclusion bodies (insoluble) or 
in a soluble form. Purification proceeded on appropriate material. Table 29 shows the molecular 

JO weight of the expressed fragments of SARS-CoV ORFs, whether they were cloned (+ or -), 
whether the cloned fragment was seen to be expressed (+ or -) and the form of protein which 
was chosen for purification. 

Where a protein was a soluble His-tagged product, a single colony was streaked and grown 
overnight at 37°C on a LB/Amp (100 /ig/ml) agar plate. An isolated colony from this plate was 

25 inoculated into 20ml of LB/Amp (100 /xg/ml) liquid medium and grown overnight at 37°C with 
shaking. The overnight culture was diluted 1:30 into 1.0 L LB/Amp (100 /ig/ml) liquid medium 
and allowed to grow at the optimal temperature (30 or 37°Q until the OD550nm reached 0.6- 
0.8. Expression of recombinant protein was induced by addition of EPTG (final concentration 1.0 
mM) and the culture incubated for a further 3 hours. Bacteria were harvested by centrifugation at 

30 8000 x g for 15 min at 4°C. The bacterial pellet was resuspended in 10 ml of cold buffer A (300 
mM NaCl, 50 mM phosphate buffer, 10 mM imidazole, pH 8.0). Cells were disrupted by 
sonication (or French Press) on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. Supematants were mixed with 150/xl Ni 2+ -resin 
(previously equilibrated with buffer A) and incubated at room temperature with gentle agitation 

35 for 30 min. The resin was Chelating Sepharose Fast Flow (Pharmacia), prepared according to the 
manufacturer's protocol. The batch-wise preparation was centrifuged at 700 x g for 5 min at 4°C 
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and the supernatant discarded The resin was washed twice (batch-wise) with 10ml buffer A for 
10 min, resuspended in 1.0 ml buffer A and loaded onto a disposable column. The resin 
continued to be washed with buffer A at 4°C until the OD280nm of the flow-through reached 
0.02-0.01. The resin was further washed with cold buffer B(300 mM NaCl, 50 mM phosphate 

5 buffer, 20 mM imidazole, pH 8.0) until the the OD280nm of the flow-through readied 0.02-0.01. 
The His-fusion protein was eluted by addition of 700/il of cold elution buffo: C (300 mM NaCl, 
50mM phosphate buffer, 250 mM imidazole, pH 8.0) and fractions collected until the OD280nm 
indicated all the recombinant protein was obtained 20/il aliquots of each elution fraction were 
analyzed by SDS-PAGE. Protein concentrations were estimated using die Bradford assay. 

10 Where a protein was seen as an insoluble product, the inclusion bodies were purified as 

follows: homogenize cells (5 g wet weight) in 25 ml 0.1M Tris HC1 pH 7, ImM EDTA, at 4°C 
using an ultraturrax (10000 rpm); add 1.5mg lysozyme per gram cells; mix shortly with an . 
ultraturrax and incubate at 4°C for 30'; use sonication or high-pressure homogenization to 
disrupt the cells; to digest DNA, add MgCh to a final concentration of 3mM and DNase to a 

L5 final concentration of lOug/ml and incubate 30' at 25°C. add 0.5 vol of 60mM EDTA; 6% Triton 
x-100, 1.5M NaCl pH 7.0 to the solution, and incubate for 30' at 4°C; centrifugation at 31000 g 
for 10' at 4°C; re-suspend pellet in 40ml of 0; 1M Tris HQ pH 7.0, 20 mM EDTA using 
ultraturrax; centrifugation at 31000 g for 10' a 4°C; store the IB pellet at - 20°C. 

The results of expression are shown in Figures 86 to 105. Examples of purity and yield are 

10 given in Table 30. 

EXAMPLE 26: Retention of critical epitope on truncated Spike antigen 

A human monoclonal antibody having neutralizing activity was tested in an HI ISA assay 
for reactivity with the purified truncated Spike protein. Briefly, BUS A plates were coated with 
truncated form of the spike protein at a concentration of 1 jig/ml (lQOjd/ well) by incubating the 

>5 plates overnight at 4°C. The plates were washed, non-specific binding sites were blocked and 
then different dilutions of the antibody were added and plates were incubated for 1 hour at room 
temperature. At the end of incubation, the plates were washed and bound antibody was detected 
by using anti-human IgG conjugated to horse radish peroxidase (HRP) and an appropriate 
substrate. The optical density of each well was recorded at 405 nm'using an ELBA reader. The 

}0 data are shown in Figure 69 and clearly demonstrate that the neutralizing epitope recognized by 
the mAb is preserved and exposed on the recombinant truncated Spike protein. 

EXAMPLE 27: Different Spike vaccines 

Purified truncated spike protein was used to immunize mice and the level of binding 
antibodies induced against the truncated spike protein was determined by EI ISA assay. Briefly a 
J5 group of 10 mice were immunized with 3/ig of truncated spike protein adjuvanted in MF59 at 0, 

-325- 



WO 2004/092360 PCT/DS2004/011710 

4 and 8 weeks intervals. Sera samples were collected from these animals and assayed for 
antibodies induced by truncated spike protein in an ELK A assay. An additional group of 8 mice 
was immunized with 75 fig of DNA encoding the truncated form of the spike protein on PLG 
particles at 0, 4 and 13 weeks intervals, the sera were collected and analyzed as above for anti- 

5 spike antibodies as above 

The profile of binding antibodies induced in each group was plotted as geometric mean 
titer (GMT). Compared to a plasmid DNA vaccine expressing truncated spike antigen and 
delivered using a PLG microparticle formulation, the purified truncated spike protein was 
significantly more potent for inducing strong antibody responses. Further comparison with the 

LO antibody responses induced by inactivated BPL-SARS-CoV (already shown protective) in the 
same mouse strain revealed that the magnitude of antibody responses induced by purified 
truncated spike protein and the inactivated virus vaccine are in the same range (Figure 70). 

The neutralization potential of antibodies induced by the recombinant truncated spike 
protein, or plasmid DNA expressing the same spike antigen, were also evaluated. The GMT 

.5 values obtained in both groups are shown in Figure 71. From these data, it appears that the 
purified protein is significantly more effective at inducing neutralizing antibody responses 
against the SARS-CoV spike. Furthermore, the neutralization titers typically induced by the 
purified truncated spike protein are comparable to neutralization titers induced by an inactivated 
SARS-CoV vaccine. 

0 Figure 72 shows a comparison of antibody binding levels (ELKS A, X-axis) with 

neutralization titers (Y-axis). In general there is a very good correlation between the binding and 
neutralizing antibodies. The bottom-left grouping shows ratios 2 weeks post-3rd immunization 
with the DNA vaccine; the top-right grouping shows ratios 2 weeks post-2nd immu nization with 
the protein vaccine. Both forms of vaccine show a consistent correlation. 

5 In further experiments, the ability of a DNA vaccine to invoke an immune response in 

mice was studied. Mice were immunized with pCMV-nSdTC plasmid, either free or with PLG 
microparticles. Serum from the mice was then used as the staining antibody against cultured 293 
cells that had been transfected with spike, either full-length or truncated The cells were 
centrifuged prior to testing and the pellet was lysed The antibody was tested against the culture 

3 supernatant and against the cell lysate. As shown in Figure 1 12, the mouse serum was able to 
detect spike protein in the lysate of cells that expressed full-length spike and in the supernatant 
of cells that expressed the truncated spike protein. Results were comparable to the staining seen 
when using rabbit serum that had been obtained after immunization with whole killed virus. 
Thus anti-spike antibodies can be induced by the use of DNA vaccination. 
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EXAMPLE 28: Expression cassettes in pCMV 

The sequence of plasmid pCMVKm2 is given as SEQ ID NO: 9923. Genes encoding the 
spike protein either in full-length form (pCMVKm2 SARS Spike nS; SEQ ID NO: 9921) or in its 
ATC form (pCMVKm2 SARS Spike nSATC; SEQ ID NO: 9922) were inserted into this basic 
5 vector. 

Mice were immunized with these vectors, and with similar vectors encoding the N, M or E 
proteins. Vectors encoding the same proteins but with optimized codon usage were also 
prepared. Codons were optimized for efficient human expression starting from the ERA 
sequence (GenBank: AY310120). The optimized sequences are: N (SEQ ID NO: 9924); M (SEQ 

10 ID NO: 9925); E (SEQ ID NO: 9926). 

After administration, expression of proteins could be detected by immunofluorescence in 
all cases. For example, Figure 106 shows immunofluorescence (using anti-SARS rabbit serum) 
results after administration of the vector encoding optimsed N antigen, revealing high level 
expression. Mice receiving the control vector alone showed no fluorescence. 

15 Figure 107 compares immunofluorescence (using Abgent anti-M antibody) of the native M 

sequence (107 A) or the codon-op timsed M sequence (107B). Similarly, Figure 108 compares 
immunofluorescence (using Abgent anti-E antibody) of the native E sequence (108A) or the 
codon-optimsed E sequence (108B). 

Four groups of mice (8 mice per group) were immunized with: (1) SARS nS Spike, nSdTC 

>0 truncated Spike, and N proteins; (2) pCMV-SARS-nSdTC: DNA+DNA-PLG at weeks 0,4 and 
13 wks; (3) CMV~nS: DNA+DNA-PLG+VEE/SIN Rep at 0, 4 and 9 wks; (4) VEE/SIN Rep- 
SARS-nS three times at 0, 4 and 13 wks. Sera from all groups recognized SARS nS and nSdTC 
proteins, and also showed virus binding and neutralization activity. 

EXAMPLE 29: Spike protein cleavage 
15 To investigate the effect of proteolytic cleavage on SARS-CoV Spike protein, it was 

expressed in various forms in Ecoli, including: (1) full-length S1-S2; (2) SI alone; (3) HR1 
heptad; and (4) HR2 heptad. The expressed proteins were used to raise immune rabbit sera, 
which were then used for visualizing western blots of Vero cells, either infected or not infected 
withSARS-CoV. 

\0 Figure 109 shows a western blot using a 1:10000 dilution of antibody raised against either 

the SI domain or the uncleaved S1-S2 domains. Figure 110 shows a western blot using a 
1:10000 dilution of antibody raised against each of the four proteins. The difference in antigen 
reactivity is readily apparent 

Figure 111 shows similar data. Each serum was tested against four lanes, with those gour 

15 lanes being from left to right (a) serum at 1:500 dilution, SARS-CoV-infected cells; (b) serum at 
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1:500 dilution, non-infected cells; (c) serum at 1:2500 dilution, SARS-CoV-infected cells; (d) 
serum at 1:2500 dilution, non-infected cells. Again, the difference in antigen reactivity is readily 
apparent 

Figures 109-111 show that the Spike protein exists in various forms in infected Vero cells, 
5 with sizes of approx. 75kDa, 90kDa, 180kDa and >250kDa* Hie Spike protein is cleaved (at 
least partially) either intracellulary or after release of the particles. 

If enzymatic cleavage of the mouse hepatitis coronaviius spike protein is inhibited then 
cell-cell fusion (syncytia formation) is also inhibited, but virus-cell fusion is not (de Haan et aL 
(2004) / Virol). Syncytia are observed in vivo in the lungs of SARS-infected patients, but are not 
10 seen in Vero cell cultures of the SARS-CoV. Inhibition of Spike protein cleavage may thus be 
used to prevent syncytia formation and related pathology, even though viral infectivity may not 
be blocked. ; 

Example 30: Purification of SARS protease 

Cells were grown at 37°C to mid-log phase and induced with 0.2% I^arabinose. Cells were 

15 harvested by centrifugation, and the cells resiispended in lysis buffer (LB) containing 20 mM 
Tris pH 7.5, 500 mM NaCl, 5% glycerol V/V, 0.05% Triton X-100, 5 mM 0ME, 5 mM 
imidazole, and complete protease inhibitors (-)EDTA. Benzonase was added to a final 1 
concentration of 50U/ml of lysate. Cells were then lysed using two passes through a pre-chilled 
microfluidizer. The lysate was clarified by high speed centrifugation at 44,000 x g. Clarified 

20 lysate was applied to a prepared Pharmacia chelating FF column charged with nickel sulfate. 
After application of the lysate the column was washed with 5 column volumes of LB, followed 
by 5 column volumes of LB supplemented with 45 mM imidazole. The column was then eluted 
using LB supplemented with 250 mM imidazole. Purity of the isolated SARS protease was 50%. 
Fractions containing protease were pooled, adjusted to 5 mM EDTA, and then applied to a 

15 Superdex 200 gel filtration column equilibrated in 20 mM Tris pH 7.5, 150 mM Nad, 5% V/V 
glycerol, 0.05 % Triton X-100, and 5 mM DTT. Purity of the isolated SARS protease was 70%. 
Again, fractions containing the protease were pooled, and then stored at -80°C until used 
Activity assay, mass spectrometry and western blot analysis were used to positively identify the 
protein (FIG 133). All steps were carried out with pre-chilled buffers, and kept at 4°C for as 

JO much of the preparation as possible. 

Western of SARS Protease Purification Fractions 

Protocol: Briefly, protein concentration was based on Absorbance at 280 nm, and coomassie 
stained gel estimates of purity. Protein was run on a 4-20% gradient gel, and transferred to 
nitrocellulose. The blot was then blocked with 3% BSA, probed with Mouse IgG anti-pentaEfis, 
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and then probed with a secondary antibody to Mouse IgG conjugated with HRP. Hie blot was 
visualized using an ECL kit (Pharmacia Biotech). Hie results are shown in Figure 133 where A 
is the sizing column pool loaded at 50, 100 and 200 ng of target protein and B is the immobilized 
metal affinity column pool loaded at 50, 100 and 200 ng of target protein. 

5 

Example 31: Continuous Fluorescence Resonance Energy Transfer (FRET) Enzyme Assay 
Hie peptide containing EDANS, the fluorescence donor, and DABCYL, the fluorescence 
quencher (DABCYL-VNSTLQ VSGLRK-EDANS) was synthesized by Syn. Pep. (Dublin, CA). 
Hie peptide contains the cleavage site Gln-Ser in the middle. Meyers, G. et aL Handbook of 

10 Proteolytic Enzymes and Barrett, A et aL Academic Press, London, 1998, 726-728. The 
proteolytic activity of SARS protease was followed kinetically by measuring the level of 
formation of cleaved product that contains the fluorescence donor, SGLRK-EDANS using the 
Hitachi fluorometer (F-4500 EL Spec.) set at 340 nm excitation and 490 nm emission wave 
length. 5 pL of 5 mM peptide stock in DMSO solution was added to the reaction mixture, 

15 containing 295 pi of standard buffer (75 mM Tris-Hcl, 25 mM NaOAc, 25 mM Bis-Tris, 25 mM 
glycine, 5 mM EDTA, and 1 mM EDTA, pH 7.4) and 100 ul of buffer or 100 ul of 3.6 uM 
protease stock solution. The kinetic curve was followed for 6 minutes (the reaction was linear 
with R2 value of 0.998 (FIG 134)). The formation of fluorescence (proteolytic reaction) is likely 
enzyme dependent, as concentration of enzyme was tripled three times as much fluorescence was 

20 formed in the 6 minutes time frame. 

It will be understood that the invention has been described by way of example only and 
modifications may be made whilst remaining within the scope and spirit of the invention. 

25 
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Hepatitis C Virus Ns5b Compositions And Methods Of Use Thereof 


10/14/1999 


6323180 


Bepatitis C Inhibitor Tri-Peptides 


11727/2001 


6143715 


Bepatitis C Inhibitor Peptide Analogues 


11/7/2000 


6329379 


Hepatitis C Inhibitor Tri-Peptides 


12/11/2001 


6329417 


HepatitisCInhibitorTri-Pepitides 


12/11/2001 
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6410531 


Hepatitis C Inhibitor Iri-Peptides 


6725/2002 


6420380 


Hepatitis C Inhibitor Iri-Peptides 


7/16/2002 


6448281 


Viral Polymerase Inhibitors 


J/10/2002 


6479508 


Viral Polymerase Inhibitors 


11/12/2002 


6534523 


Hepatitis C Inhibitor lYi-Peptides 


3/18/2003 


WOOO/09543 


Hepatitis C Inhibitor Tri-Peptides 


2/24/2000 


WOOO/09558 


Hepatitis C Inhibitor Peptides 


2/24/2000 


WOOQ/59929 


Macrocyclic Peptides Active Against Hie Hepatitis C Vans 


10/12/2000 


WO02/04425 


Viral Polymerase Inhibitors 


1/17/2002 


WO02/70739 


Hcv Polymerase Inhibitor Assay 


9/12/2002 


WO03/07945 


Viral Polymerase Inhibitors 


1/30/2003 


WO03/10140 


Viral Polymerase Inhibitors 


2/672003 


WO03/10141 


Viral Polymerase Inhibitors 


2/6/2003 


WO99/07734 


Hepatitis C Inhibitor Peptide Analogues 


2/18/1999 


WO01/16379 


Hepatitis C Virus Replication Inhibitors 


3/8/2001 


WO02/07761 


Inhibiting Hepatitis C Virus Processing And Replication 


1/31/2002 


WO02/57287 


Nucleoside Derivatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 


7/25/2002 


WO02/57425 


Nucleoside Derivatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 


7/25/2002 


WO02/70651 


Viral Reporter Particles 


?/12/2002i 


WO03/20222 
PCT/US2003/ 
041493 


Dioxolane And Oxathiolane Derivatives As Inhibitors Of Rna-Dependent Rna Viral Polymerase 
Ihiosemicarbazones as Anti-Virals and Immunopotentiators ' - 


3A3/2003 j 
Dl/10/2003 



Table 3 ; US Patents and published international patent applications relating to inhalation 
technology for the delivery of antiviral compounds of the invention. 



Publication 
Number 


Title 


Publication 
Date 


5740794 


Apparatus and methods for dispersing dry powder medicaments 


4/21/1998 


5775320 


Method and device for delivering aerosolized medicaments 


7/7/1998 


5785049 


Method and apparatus for dispersion of dry powder medicaments 


7/28/199S 


5814607 


Pulmonary delivery of active fragments of parathyroid hormone 


9/29/1998 


5826633 


Powder filling systems, apparatus and methods 


10/27/1998 


5458135 


Method and device for delivering aerosolized medicaments 


10/17/1995 


5607915 


Pulmonary delivery of active fragments of paramyloid hormone 


3/4/1997 


5654007 


Methods and Systran for processing dispersible fine powders 


8/5/1997 


5922354 


Methods and system for processing dispersible fine powders 


7/13/1999 


5928469 


Process for storage of materials 


7/27/1999 


5976574 


Processes for spray drying hydrophobic drugs in organic solvent suspensions 


11/2/1999 


5985248 


Processes for spray drying solutions of hydrophobic drags and compositions thereof 


11/16/1999 


5994314 


Compositions and methods for nucleic acid delivery to the lung 


11/30/1999 


5997848 


Methods and compositions for pulmonary delivery of insulin 


12/7/1999 


6001336 


Processes for spray drying aqueous suspensions of hydrophobic drugs and compositions thereof 


12/14/1999 


6019968 


Dispersible antibody compositions and methods for their preparation and use 


2/1/2000 


6051256 


Dispersible macromolecule compositions and methods for their preparation and use 


4/18/2000 


6071428 


Stable compositions 


6/6/2000 


6077543 


Systems and processes for spray drying hydrophobic drugs with hydrophilic exripients 


6/20/2000 


6080721 


Pulmonary delivery of active fragments of parathyroid hormone 


6/27/2000 


6089228 


Apparatus and methods for dispersing dry powder medicaments 


7/18/2000 


6103270 


Methods and system for processing dispersible fine powders 


8/15/2000 


6123936 


Methods and compositions for the dry powder formulation of interferons 


9/26/2000 
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6136346 


Powdered pharmaceutical formulations having improved dispersibili ty 


10/24/2000 


6138668 


Method and device for delivering aerosolized medicaments 


10/31/2000 


6165463 


Dispersible antibody compositions and methods for their preparation and use 


12/26/2000 


6182712 


Power filling apparatus and methods for their use 


2/6/2001 


6187344 


Powdered pharmaceutical formulations having improved dispersibility 


2/13/2001 


6207135 


jaseous ndcroparticles for ultrasonic diagnosis and process for their production 


3/27/2001 


6231851 


Methods and compositions for the dry powder formulation of interferons 


5/15/2001 


6257233 


Dry powder dispersing apparatus and methods for their use 


7/10/2001 i 


6258341 


Stable glassy state powder formulations 


7/10/2001 


6267155 


Powder filling systems, apparatus and methods 


7/31/2001 


6294204 


Method of producing morphologically uniform microcapsules and microcapsules produced by 
lus method 


9/25/2001 


6303582 


Compositions and methods for nucleic acid delivery to the lung 


10/16/2001 


6309623 


Stabilized preparations for use in metered dose inhalers 


10/30/2001 


6309671 


Stable glassy state powder formulations. 


10/30/2001 


6358530 


Powdered pharmaceutical formulations having improved dispersibility 


3/19/2002 


6365190 


Systems and processes for spray drying hydrophobic drugs with hydrophilic excipients 


4/2/2002 


6372258 


Methods of spray-drying a drug and a hydrophobic amino acid . 


4/16/2002 


6423344 


Dispersible macromolecule compositions and methods for their preparation and use 


7/23/2002 


6426210 


Storage of materials 


7/30/2002 


6433040 


Stabilized bioactive preparations and methods of use 


8/13/2002 


6440337 


Method and apparatus for the formation of particles 


8/27/2002 


RE37872 


Storage of materials 


10/8/2002 


6479049 


Methods and compositions for the dry powder formulation of interferons 


11/12/2002 


6503411 


Stable compositions 


1/7/2003 . 


6509006 


Devices compositions and methods for the pulmonary delivery of aerosolized medicaments 


1/21/2003 


6514496 


Dispersible antibody compositions and methods for their preparation and use 


2/4/2003 


6518239 


dry powder compositions having improved dispersivity 


2/11/2003 


6543448 


apparatus and methods for dispersing dry powder medicaments 


4/8/2003 


6546929 


dry powder dispersing apparatus and methods for their use 


4/15/2003 


WO 00/15262 


dry powder active agent pulmonary delivery 


3/23/2000 


WO 93/00951 


method and device for delivering aerosolized medicaments 


1/21/1993 


WO 94/07514 


pulmonary delivery of active fragments of parathyroid hormone - ' 


4/14/1994 


WO 95/24183 


tnethods and compositions for pulmonary delivery of insulin 


M4/1995 


WO 95/31479 


methods and compositions for the dry powder formulation of interferons 


11/23/1995 


WO 96/09085 


apparatus and methods for dispersing dry powder medicaments 


3/28/1996 


WO 96/32096 


powdered pharmaceutical formulations having improved dispersibility 


10/17/1996 


WO 96/32116 


compositions and methods for nucleic acid delivery to the lung 


10/17/1996 


WO 96/32149 


pulmonary delivery of aerosolized medicaments 


10/17/1996 


WO 96/32152 


pulmonary administration of dry powder alpha 1-antitrypsin 


10/17/1996 


WO 96/40068 


methods and system for processing dispersible fine powders 


12/19/1996 


WO 97/41031 


powder filling systems, apparatus and methods 


11/6/1997 


WO 97/41833 


dispersible macromolecule compositions and methods for their preparation and use 


11/13/1997 


WO 98/16205 


stable glassy state powder formulations 


4/23/1998 


WO 98/29096 


aerosolized hydrophobic drug 


7/9/1998 


WO 98/29098 


processes for spray drying aqueous suspensions of hydrophobic drugs with hydrophilic 
sxcipients and compositions prepared by such processes 


7/9/1998 


WO 98/29140 


>rocesses and compositions for spray drying hydrophobic drugs in organic solvent suspensions 
of hydrophilic excipients 


7/9/1998 


WO 98/29141 


processes for spray drying solutions of hydrophobic drugs with hydrophilic excipients and 
impositions prepared by such processes 


7/9/1998 


WO 99/19215 


powder filling apparatus and method ' 


4/22/1999 ! 


WO 99/42124 Oiquid crystal forms of cyclosporin 


8Q671999 
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WO 99/47196 


aerosolized active agent delivery 


3/23/1999 


WO 99/62495 


dry powder dispersing apparatus and methods for their use 


12/9/1999 


WO 00/21594 


[low resistance modulated aerosolized active agent delivery 


4/20/2000 


WO 00/61178 


julmonary administration of dry powder formulations for treating infertility 


10/19/2000 


WO 00/72904 


apparatus and method for dispensing metered amount of aerosolized medication 


12/7/2000 


WO 01/00263 


systems and methods for aerosolizing pharmaceutical formulations 


174/2001 


WO 01/00312 


spray drying process for preparing dry powders 


1/4/2001 


WO 01/32144 


dry powder compositions having improved dispersivity 


5/10/2001 


WO 01/43529 


receptacles to facilitate the extraction of powders 


6/21/2001 


WO 01/43530 


systems and methods for extracting powders from receptacles 


6/21/2001 


WO 01/43802 


systems and methods for treating packaged powders 


6/21/2001 I 


WO 01/44764 


systems and methods for non-destructive mass sensing 


6/21/2001 ! 


WO 01/87393 


systems, devices and methods for opening receptacles having a powder to be fhridized 


11/22/2001 


WO 01/93932 


lockout mechanism for aerosol drug delivery devices 


12/13/2001 


WO 02/09669 


apparatus and process to produce particles having a narrow size distribution and particles made 
thereby 


2/7/2002 


WO 02/11695 


inhaleable spray dried 4-hedix bundle protein powders having minimized aggregation 


2/14/2002 


WO 02/49619 


nduced phase transition method for the production of nricroparticles containing hydrophilic 
active agents 


6/27/2002 


WO 02/49620 


induced phase transition method for the production of microparticles containing hydrophobic 
active agents 


6/27/2002 


WO 02/54868 


pulmonary delivery of polyene antifungal agents 


7/18/2002 


WO 02/87542 


novel methods and compositions for delivering macromolecules to or via the respiratory tract 


1177/2002 


WO 02/100548 


centrifuged rotating drum for treating cohesive powders * • 


12/19/2002 


WO 03/00326 


powder aerosolization apparatus and method 


1/3/2003 


WO 03/00329 


low regulator for aerosol drug delivery device and methods 


1/3/2003 



TABLE 4: Forward and reverse primers for nucleic add amplification of SARSV 



Pair Number 


Forward primer SEQ ID NO 


Forward Primer Start 


Forward Primer Stop 


Forward Primer Tm 


Forward Primer %GC 


Reverse primer SEC) 10 NO 


Reverse Primer Start 


Reverse Primer Stop 


Reverse Primer Tm 


Reverse Primer %GC 


Primer TmDiff 


Product Length 


Product Tm 


Product %GC 


Anneal Score 


Optimum Anneal Temp 


1 


1021 


12726 


12746 


51.3 


47.6 


3521 


12996 


12977 


502 


40 


1 


271 


75 


42.8 


26 


52.6 


2 


1022 


12236 


12256 


512 


42.9 


3522 


( 12993 


12975 


51.4 


47.4 


02 


758 


76.4 


425 


26 


54 


3 


1023 


12373 


12391 


50.8 


47.4 


3523 


12993 


12975 


51.4 


47.4 


0.6 


621 


76.4 


43 


26 


53.8 


4 


1024 


12236 


12256 


512 


42.9 


3524 


12996 


12977 


502 


40 


0.9 


761 


76.4 


42.3 


26 


53.6 


5 


1025 


12373 


12391 


50.8 


47.4 


3525 


12996 


12977 


502 


40 


0.5 


624 


76.4 


42.8 


26 


53.6 


6 


1026 


12726 


12746 


51.3 


47.6 


3526 


12993 


12975 


51.4 


47.4 


0.1 


268 


75.1 


43.3 


26 


53.1 


: 7 


1027 


2671 


2692 


52.1 


40.9 


3527 


3185 


3164 


51 


455 


12 


515 


75.6 


41.6 


26 


53.3 


i s 


1028 


28942 


28961 


502 


45 


3528 


29298 


29280 


51.4 


52.6 


12 


357 


76.4 


44.8 


26 


53.6 


9 


1029 


19801 


19819 


532 


52.6 


3529 


19922 


19901 


515 


455 


1.7 


122 


722 


43.4 


26 


51.1 


10 


1030 


19800 


19817 


50.4 


50 


3530 


19921 


19901 


502 


47.6 


0.3 


122 


722 


43.4 


26 


50.7 


11 


1031 


9930 


9946 


515 


52.6 


3531 


10605 


10588 


51.1 


50 


0.4 


676 


75.8 


412 


27 


53.5 


12 


1032 


9933 


9952 


50.9 


45 


3532 


10605 


10588 


51.1 


50 


02 


673 


75.8 


412 


27 


53.4 


13 


1033 


9930 


9949 


522 


50 


3533 


10605 


10588 


51.1 


50 


1.1 


676 


75.8 


412 


27 


53.5 
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14 


1034 


9927 


9945 


50.8 


52.6 


3534 


10605 


10588 


51.1 


50 


02 


679 


75.8 


412 


28 


53.4 


15 


1035 


3789 


3806 


50 


50 


3535 


4445 


4425 


50.6 


42.9 


05 


657 


755 


405 


28 


529 


16 


1036 


3788 


3805 


50 


50 


3538 


4444 


4424 


50.6 


429 


05 


657 


755 


405 


28 


52.9 


17 


1037 


3795 


3813 


52.1 


52.6 


3537 


4445 


4425 


50.6 


42.9 


15 


651 


755 


40.6 


28 


531 


18 


1038 


3787 


3804 


50 


50 


3538 


4445 


4425 


50.6 


429 


05 


659 


75.4 


40.4 


28 


529 


| 19 


1039 


19801 


19819 


532 


52.6 


3539 


19921 


19900 


51.8 


455 


1.4 


121 


723 


438 


28 


512 


20 


1040 


24418 


! 24436 


50 


47.4 


3540 


25182 


25164 


51.4 


47.4 


1.4 


765 


76.1 


41.7 


28 


534 


21 


1041 


9929 


9949 


53.8 


47.6 


3541 


10449 


10425 


54.6 


40 


0.8 


521 


75.4 


40.9 


28 


54 


22 


1042 


2671 


2692 


52.1 


40.9 


3542 


3186 


3165 


50.4 


40.9 


1.7 


516 


75.6 


415 


28 


53.1 


23 


1043 


3792 


! 3810 


52.9 


52.6 


3543 


4446 


4425 


51.8 


455 


1.1 


655 


75.5 


40.6 


28 


53.5 


24 


1044 


9933 


9952 


50.9 


45 


3544 


10449 


10431 


50.9 


47.4 


0.1 


517 


75.3 


40.8 


28 


531 


25 


1045 


3792 


3810 


52.9 


52.6 


3545 


4445 


4424 


51.3 


40.9 


1.6 


654 


755 


405 


28 


533 


26 


1046 


25782 


25806 


535 


40 


3546 


26184 


26164 


52.4 


42.9 


1.1 


j 403 


74.7 


402 


28 


531 


27 


1047 


9927 


9945 


50.8 


52.6 


3547 


10449 


10431 


50.9 


47.4 


0.1 


523 


75.4 


40.9 


28 


531 


28 


1048 


9927 


! 9945 


50.8 


52.6 


3548 


10449 


10428 


51.9 


40.9 


1.1 


523 


75.4 


40.9 


28 


53.1 


29 


1049 


3789 


I 3806 


50 


50 


3549 


4444 


4424 


50.6 


42.9 


05 


656 


755 


405 


28 


53 


30 


1050 


3795 


3813 


52.1 


52.6 


3550 


4444 


4424 


50.6 


429 


15 


650 


755 


40.6 


28 


53.1 


31 


1051 


9933 


9952 


50.9 


45 


3551 


10449 


10428 


51.9 


40.9 


1.1 


517 


752 


' 40.8 


28 


53.1 


32 


1052 


9930 


9948 


51.5 


52.6 


3552 


10449 


10431 


50.9 


47.4 


05 


520 


75.4 


. 41 


28 


532 


33 


1053 


9930 


9948 


51.5 


52.6 


3553- 


• 10449 


10428 


51.9 


40.9 


0.4 


520 


75.4 


41 


28 


533 


34 


1054 


9929 


9948 


53.2 


50 


3554 


10449 


10425 


54.6 


40 


1.4 


521 


75.4 


40.9 


28 


53.8 


35 


1055 


9931 


9952 


53 


45.5 


3555 


10449 


10425 


54.6 


40 


1.6 


519 


753 


40.8 


28 


53.7 


36 


1056 


3791 


3808 


50 


50 


3556 


4445 


4425 


50.6 


42.9 


05 


655 


75.5 


405 


28 


529 


37 


1057 


3791 


3808 


50 


50 


3557 


4444 


4424 


50.6 


42.9 


0.5 


654 


755 


405 


28 


53 


38 


1058 


9930 


9949 


52.2 


50 


3558 


10449 


10431 


50.9 


47.4 


12 


520 


75.4 


41 


28 


532 


39 


1059 


9930 


9949 


522 


50 


3559 


10449 


10428 


51.9 


40.9 


0.3 


520 


75.4 


■ 41 


28 


53.5 


40 


1060 


3788 


3805 


50 


50 


3560 


4445 


4425 


50.6 


42.9 


05 


658 


75.5 


• 40.4 


28 


52.9 


41 


1061 


19800 


19817 


50.4 


50 


3561 


19921 


19900 


51.8 


45.5 


1.4 


122 


722 


1.43.4 


28 


•50.8 


42 


1062 


3787 


3804 


50 


50 


3562 


4444 


4424 


50.6 


42.9 


05 


658 


755 


• 40.4 


28 


529 


43 


1063 


25782 


25806 


535 


40 


3563 


26183 


26163 


51.7 


42.9 


1.7 


402 


74.7 


40.3 


28 


52.9 


44 


1064 


25782 


25806 


53.5 


40 


3564 


26183 


26160 


545 


41.7 


1 


402 


74.7 


40.3 


28 


535 


45 


1065 


25782 


25806 


53.5 


40 


3565 


26183 


26159 


54.9 


40 


15 


402 


74.7 


• 40.3 


28 


: 53.5 


46 


1066 


2429 


2447 


502 


47.4 


3566 


3187 


3166 


50.3 


455 


0.1 


759 


76.6 


43 


29 


53.8 


47 


1067 


2427 


2445 


52.1 


52.6 


3567 


3185 


3164 


51 


45.5 


1.1 


759 


76.7 


-.43.1 


29 


54.1 


48 


1068 


2429 


2447 


502 


47.4 


3568 


3185 


3164 


51 


455 


0.7 


757 


76.6 


429 


29 


53.8 


49 


1069 


19800 


19817 


50.4 


50 


3569 


19923 


19904 


50.1 


50 


0.3 


124 


72.3 


435 


29 


50.8 


50 


1070 


2427 


2445 


52.1 


52.6 


3570 


3187 


3166 


50.3 


45.5 


1.8 


761 


76.7 


43.1 


29 


53.9 


51 


1071 


29183 


29204 


50.4 


40.9 


3571 


29412 


29393 


50.3 


45 


0 


230 


75.3 


44.8 


29 


529 


52 


1072 


16367 


16386 


51.4 


50 


3572 


16780 


16760 


51.4 


429 


0.1 


414 


75 


40.8 


30 


53 


53 


1073 


11543 


11562 


50.4 


40 


3573 


12254 


12236 


50.5 


47.4 


0.1 


712 


762 


42 


30 


53.6 


j 54 


1074 


12976 


12995 


51.1 


45 


3574 


13547 


13528 


502 


45 


0.9 


572 


77.4 


455 


30 


54.3 


55 


1075 


12040 


12057 


50.6 


50 


3575 


12254 


12236 


50.5 


47.4 


0.1 


215 


75.5 


45.6 


30 


53.1 


56 


1076 


12976 


12996 


51.8 


42.9 


3576 


13544 


13525 


526 


55 


0.8 


569 


77.5 


45.7 


30 


54.8 


57 


1077 


10141 


10160 


51 


45 


3577 


10605 


10588 


51.1 


50 


0.1 


465 


74.9 


402 


30 


528 


58 


1078 


12235 


12253 


50.1 


52.6 


3578 


12996 


12977 


502 


40 


0.1 


762 


76.4 


424 


30 


53.6 


59 


1079 


19795 


19814 


50.4 


45 


3579 


19921 


19901 


502 


47.6 


0.3 


127 


723 


433 


30 


50.8 


on 

60 


4AAA 

lOoO 


12235 


12253 


50.1 


52.6 


3580 


12993 


12975 


51.4 


47.4 


1.3 


759 


765 


426 


30 


53.7 


61 


1081 


12976 


12994 


50.3 


47.4 


3581 


13547 


13528 


502 


45 


0.1 


572 


77.4 


45.5 


30 


54.3 


62 


1082 


12975 


12994 


52.1 


45 


3582 


13544 


13525 


52.6 


55 


05 


570 


77.4 


45.6 


30 


54.9 


63 


1083 


12977 


12996 


502 


40 


3583 


13547 


13528 


502 


45 


0 


571 


772 


45.4 


30 


54.3 


64 


1084 


11541 


11561 


50.9 


42.9 


3584 


12254 


12236 


50.5 


47.4 


0.3 


714 


762 


42 


30 


53.6 


65 


1085 


28394 


28411 


502- 


50 


3585 


28672 


28654 


50.6 


526 


0.3 


279 


78.6 


51.6 


30 


552 
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66 
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75.3 


48.3 


42 


53 


2475 




2522 


2541 

cot 1 


51 4 

O l.*T 


i 4c 

HO 


5QQ5 


cO/0 


cOOO 


0U.4 


CA 
OU 


A 

i 


1 CA 

154 


752 


48.1 


42 


52.9 


2476 


utvU 


2429 


2447 


50 9 


47 4 


5QQ8 


once 
oUOO 


0007 
0UO/ 


CO 1 
OC.1 


cc 

00 


1 0 

1.9 


628 


7A f% 

753 


42.7 


42 


53.6 


2477 


1497 


2429 


2447 


50 9 


47 4 


5QQ7 
oyy/ 


OI9U 


01 CO 

oioy 


cn 7 
OU./ 


/c c 
40.0 


A C 


762 


76.6 


42.9 


42 


53.8 


947R 


O40Q 


C/ *fOU 


97455 
C/*tOO 


597 
OCr 


45 


COOP. 

0330 


07C/M 


07C01 

c/ocl 


C1 7 

51.7 


47.0 


1 


106 


72.1 


45.3 


42 


51.1 


247Q 


Q/QQ 
0*KJ»7 


0A9Q 


9447 


en 9 
ou«c 


47 4 




OH DO 
olyc 


0171 
ol/l 


C1 0 

51 .9 


CA 

OU 


4 -7 

1.7 


764 


76.7 


43.1 


42 


53.8 


2480 


**5fl0 


9497 


9445 


59 1 

OC 1 


59 8 
Oc.0 


OUUU 


OUO0 


0A07 


cn 1 

0C.1 


cc 
00 


0 


630 


76.4 


42.9 


42 


542 


2481 


OOv l 


Cf ooo 


974fi7 
C/ *H/# 


50 8 

OU.O 


47 4 


OUUI 


c/041 


07CO1 

c/Ocl 


C1 "7 
01./ 


47.0 


1.1 


4 ro 

153 


732 


43.1 


42 


51.5 


2482 


3502 


2427 


2445 

CltO 


59 1 

OC. 1 


59 ft 
Oc.0 


paw? 
Owe 


oion 


01 CO 

oioy 


CA 7 
50.7 


AC C 

40.0 


1.4 


764 


76.7 


43.1 


42 


54 


2483 


QCAQ 


18816 


18838 


51 4 

O l.*T 


47 8 
*»/ .O 


DUUO 


IQOIR 


lycyO 


cn 
ou 


Af\ O 


1 A 

1.4 


701 


75.4 


402 


42 


52.9 


2484 


3504 


9077 
CO/ / 


CO&O 


59 4 

OC.H 


59 ft 
oc.D 


DUU4 


cOf c 


COOO 


C1 c 

01.0 


CA 


0.8 


296 


77 


47.3 


42 


54.5 


2485 


3505 

OOUO 


tooy i 


18811 


51 7 

o I ./ 


49 Q 


DUUO 


1001ft 


lyiyo 


CA O 

50.2 


Aft ft 

40.9 


1.4 


AAA 

626 


75.7 


41.1 


42 


53.1 


2486 


3506 




12384 


51 7 

O I./ 


59 fi 
oc.D 


DUUO 


1970Q 


lc/lO 


C1 
01 


Aft A 

40.9 


A "7 

0.7 


374 


75.6 


42.8 


42 


53.3 


2487 


3507 

OOvf 


2377 


23Q5 

CO^O 


59 4 
oC*r 


59 6 
oc.D 


OUUf 


OC70 
cO/c 


CD04 


CA O 

oO.y 


co a 
Oc.0 


1 c 

1.5 


296 


77 


47.3 


42 


54.3 


2488 


3508 


16982 


17001 


51 9 


55 

oo 


8008 
DUUO 


17111 
1/11 1 


1 /uyu 


C1 1 
01.1 


Aft ft 

40.9 


A 1 
0.1 


4 OA 

130 


74.6 


48.5 


42 


52.6 


2489 


3509 


2377 


23.05 
cow? 


59 4 

oct 


59 8 
oc.D 


DUU%7 


cO/0 


COOO 


CA A 

0U.4 


CA 

oO 


2 


AAA 

299 


77 


472 


42 


54.1 


2490 


3510 


18590 


18808 


50 8 

OU.D 


49 1 


OUIU 


1 be 10 


lyiyo 


CA O 

OU.c 


Af\ ft 

40.9 


A O 

0.3 


A/V7 

627 


75.0 


41 


42 


53.1 


2491 


3511 


2377 


2395 


594 


59 6 
ocu 


8011 

QUI 1 


9RQ1 

coyi 


0070 


CA Q 
OU.O 


A7 A 

4/.4 


1 e 
1.0 


C1C 

515 


76.8 


44 c 

44.5 


42 


54.1 


2492 


3512 


8220 


8240 
oc*tv 


54 

ot 


47 8 

*♦/ .D 


8019 
OU lc 




on 17 
oy 1 / 


C/l C 

04.0 


CO c 
Oc.0 


A >l 

0.4 


716 


75.4 


40.1 


42 


54.1 


2493 


3513 


12370 


12388 

I COOO 


50 1 


47 4 


DUIO 


19000 
IcyyO 


1007Q 


CA 1 
0U.1 


AC 

45 


A 1 
0.1 


con 

629 


76.4 


42.9 


42 


53.6 


2494 


3514 


999fl 


2244 


51 4 

O 1 »*T 


45 5 


R014 

OU If 


9R75 
CD/O 


COOO 


CA A 

0U.4 


CA 
OU 


4 
1 


jrn 

453 


77 


45.3 


42 


54.1 


2495 


3515 


2220 


2239 


51.3 


45 


6015 


2672 


2654 


50.9 


52.6 


0.4 


453 
too 


77 
» / 


45 3 


49 
*#c 


549 

0*r-C 


2496 


3516 


24418 


24439 


52.9 


45.5 


0)16 


24936 


24919 


51.8 


50 


1.1 


519 


76 


42.4 


42 


53.8 


2497 


3517 


18586 


18603 


50.4 


44.4 


6017 


19216 


19195 


50J2 


40.9 


02 


.631 


75.6 


40.9 


42 


53.1 


2498 


3518 


2220 


2239 


51.3 


45 


6018 


2675 


2656 


50.4 


50 


0.8 


456 


76.9 


452 


42 


54.1 


2499 


3519 


1402 


1422 


502 


42.9 


6019 


2153 


2134 


50.4 


45 


02 


752 


76.7 


43.1 


42 


sas 


2500 


3520 


1356 


1375 


53.8 


55 


6020 


2153 


2133 


5Z1 


42.9 


1.7 


798 


76.9 


43.5 


42 


545 



Table 5: Primers 



Forwai 
ID N0< 


rd primer SEQ 
i Co-ordinates 


Revere 
ID NO i 


« primer SEQ 
fc Co-ordinates 


T M (FOR & REV) (°C) 


Product 
length (bp) 


6076 


1-19 


6171 


199-183 


50.1 


50.3 


199 


6077 


149-169 


6172 


334-315 


51.5 


52.4 


186 


6078 


292-310 


6173 


560-541 


50.8 


51.1 


269 


6079 


598-619 


6174 


749-731 


52.6 


50.6 


152 


6080 


721-742 


6175 


930-912 


50.4 


50.3 


210 


6081 


888-912 


6176 


1077-1058 


52.8 


51.2 


190 


6082 


984-1003 


6177 


1149-1131 


51.1 


51.1 


166 



-387- 



wo 



2004/092360 



PCT/US2004/011710 



6083 


1157-1175 


6178 


1479-1460 


50.9 


51.6 


323 


6084 


1420-1441 


6179 


1700-1680 


51.2 


50.7 


281 


6085 


1685-1707 ] 


6180 


1834-1811 


53.8 


53.7 


150 


6086 


1740-1764 


6181 


1987-1963 


53.4 


52.2 


248 


6087 


2007-2025 


6182 


2251-2232 


50.3 


50.1 


245 


6088 


2226-2245 


6183 


2385-2366 


50.4 


50.1 


160 


6089 


2428-2446 


6184 


2749-2728 


50.1 


50.3 


322 


6090 


2742-2763 


6185 


2893-2875 


50.6 


51.4 


152 


6091 


2823-2844 


6186 


3082-3058 


50.4 


52.3 


260 


6092 


3007-3031 


6187 


3185-3164 


51.9 


51.0 


179 


6093 


3234-3254 


6188 


3497-3478 


51.1 


51.3 


264 


6094 


3453-3476 j 


6189 


3647-3627 


51.8 


52.1 


195 


6095 


3601-3622 ! 


6190 


3877-3853 


52.5 


53.6 


277 


6096 


4007-4027 


6191 


4158-4135 


51.1 


51.4 


152 


6097 


4141-4165 


6192 


4316-4295 


51.3 


50.8 


176 


6098 


4366-4387 


6193 


4567-4544 


54.6 


55.4 


202 


6099 


• 4488-4508 


6194 


4708-4690 


50.7 


50.3 


221 ■ . 


6100 


4658-4677 


6195 


4994-4974 


50.5 


51:2 


337 


6101 


4902-4922 


6196 


5115-5092 


50.5 


51.4 


214 


6102 


5239-5260 


6197 


5450-5430 


50.8 


50.9 


212 


6103 


5366-5389 


6198 


5560-5542 


50.5 


51.8 


195 


6104 


5593-5612 


6199 


5860-5836 


50.8 


51.6 


268 


6105 


.6042-6062 


6200 


6291-6271 


50.4 


51.1 


. 250 


6106 


6271-6291 


6201 


6483-6463 


51.1 


50.2 


213 


6107 


7017-7040 


6202 


7171-7153 


52.4 


52.8 


155 


6108 


7253-7272 


6203 


7504-7486 


50.3 


50.3 


252 


6109 


7415-7434 


6204 


. 7677-7654 


54.5 


53.6 


263 


6110 


7615-7635 


6205 


7821-7798 


51.1 


52.8 


207 


6111 


7728-7746 


6206 


7936-7915 


51.7 


50.1 


209 


6112 


7845-7867 


6207 


7994-7970 


52.7 


53.4 


150 


6113 


8011-8029 


6208 


8189-8170 


51.4 


50.6 


179 


6114 


8143-8166 


6209 


8300-8281 


52.2 


50.8 


158 


6115 


8221-8239 


6210 


8388-8369 


51.0 


51.1 


168 


6116 


8553-8575 


6211 


8931-8915 


51.8 


50.3 


379 


6117 


8867-8886 


6212 


9254-9236 


50.7 


50.6 


388 
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6118 


9244-9267 


6213 


9597-9573 


51.9 


53.4 


354 


6119 


9620-9640 


6214 


9990-9969 


51.3 


51.3 


371 


6120 


10009-10027 


6215 


10188-10171 


50.2 


50.2 


180 


6121 


10093-10113 


6216 


10244-10223 


52.4 


50.6 


152 


j 6122 


10242-10265 


6217 


10608-10589 


51.2 


51.0 


367 


6123 


10549-10571 


6218 


10783-10763 


53.7 


55.2 


235 


6124 


10766-10785 


6219 


10930-10912 


52.0 


51.1 


165 


6125 


11065-11085 


6220 


11305-11287 


50.7 


50.0 


241 


6126 


11265-11287 


6221 


11429-11405 


54.5 


53.5 


165 


6127 


11552-11571 


6222 


11730-11709 


52.0 


50.4 


179 


6128 


11705-11726 


6223 


11869-11848 


50.1 


50.2 


165 


6129 


11801-11824 


6224 


11984-11967 


51.5 


50.4 


184 


6130 


12040-12058 


' 6225 


12254-12235 


52.3 


51.9 


215 


6131 


12235-12253 


6226 


12406-12388 


50.1. 


50.1 


- 172 


6132 


12366-12384 


6227 


12730-12712 


51.7 


52.2 


365 


.6133 


12727-12748 


6228 


12994-12976 


50.8 


50.3 


268 


6134 


12948-12966 


. 6229 . 


13224-13201 


50.7 


51.7 


277 


6135 


.13175-13196 


6230 


13324-13300 


54.3 


55.1 


" 150 


6136 


13237-13258 


6231 


13545-13526 


52.9 


52.9 


309 


6137 


13790-13810 


6232 


13963-13945 


50.9 


50.7 


174 


6138 


14080-14098 


6233 


14280-14257 


. 51.5 


51.0 


201 


,6139 


14405-14427 


6234 


14561-14540 


50.2 


50.9 


157 


.6140 


14882-14906 


6235 


15046-15024 


50.9 


51.5 


165 j 


6141 ; 


14951-14976 


6236 


15145-15124 


53.1 


52.9 


195 


6142 


15113-15134 


6237 


15275-15257 


51.6 


50.8 


163 


6143 


15211-15230 


6238 


15383-15363 


50.2 


50.1 


173 


6144 


15364-15387 


6239 


15528-15506 


54.0 


52.1 


165 


6145 


15456-15477 


6240 


15605-15585 


52.0 


53.2 


150 


6146 


15513-15532 


6241 


15897-15876 


51.2 


50.4 


385 


6147 


15837-15856 


6242 


15999-15978 


52.3 


50.8 


163 


6148 


16073-16096 


6243 


16301-16277 


51.7 


52.8 


229 


6149 


16245-16266 


6244 


16404-16380 


50.3 


52.0 


160 


6150 


16366-16385 


6245 


16515-16492 


52.9 


53.8 


150 


6151 


16553-16571 


6246 


16777-16758 


53.4 


51.5 


225 


6152 


16832-16852 


6247 


17026-17004 


51.0 


51.6 


195 
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6153 


16982-17001 


6248 


17359-17340 


515 


50.2 


378 


6154 


17354-17372 


6249 


17511-17490 


51.3 


50.4 


158 


6155 


17422-17443 


6250 


17573-17552 


50.2 


51.1 


152 


6156 


17603-17623 


6251 


17769-17748 


50.7 


51.5 


167 


6157 


17728-17746 


6252 


17883-17862 


50.9 


515 


156 


6158 


18011-18030 


6253 


18163-18140 


52.9 


51.9 


153 


6159 


18076-18098 


6254 


18225-18205 


54.4 


55.0 


150 


6160 


18270-18292 


6255 


18432-18413 


51.9 


51.4 


163 


6161 


18352-18373 


6256 


18648-18629 


51.3 


50.8 


297 


6162 


18550-18571 


6257 


18702-18684 


50.4 


51.9 


153 


6163 


18720-18738 


6258 


19004-18983 


50.6 


51.0 


285 


6164 


18960-18981 


6259 


19109-19085 


54.7 


54.3 


150 


6165 


19065-19089 


6260 


19217-19195 


52.8 


51.7 


153 


.6166 


19310-19329 


6261 


19476-19454 


50.2 


52.1 


. 167 


. 6167 


19569-19589 


6262 


19719-19701 


50.5 


51.8 


151 


6168 


19707-19731 


6263 


19856-19833 


55.7 


55.9 


- 150 •-• 


6169 


19771-19792 


6264 


19921-19901 


50.1 


505 


151 


6170 


19833-19851 


6265 


19986-19966 


50.9 


50.7 


154 


'able 6: Primers 


Forward primer SEQ 
ID NO & Co-ordinates 


Revers 
ID UO I 


te primer SEQ 
it Co-ordinates 


T M (FOR & REV) (°C) 


Product 
length (bp) 


6266 


20110-20132 


6305 


20425-20404 


51.9 


50.9 


316 


6267 


20468-20492 


6306 


20617-20596 


53.2 


53.5 


150 


6268 


20557-20578 


6307 


20891-20871 


50.4 


50.6 


335 


6269 


20838-20856 


6308 


21037-21015 


52.5 


52.0 


200 


6270 


21096-21116 


6309 


21295-21272 


50.1 


51.7 


200 


6271 


22173-22194 


6310 


22414-22395 


52.4 


51.0 


242 


6272 


22320-22342 


6311 


22501-22479 


54.8 


54.3 


182 


6273 


22532-22552 


6312 


22695-22675 


50.6 


50.0 


164 


6274 


22712-22736 


6313 


22873-22852 


56.7 


55.5 


162 


6275 


22842-22861 


6314 


23086-23067 


51.0 


52.8 


245 


6276 


23151-23170 


6315 


23395-23376 


51.4 


50.3 


245 


6277 


23307-23326 


6316 


23524-23501 


51.1 


51.1 


218 


6278 


23615-23635 


6317 


23776-23758 


50.7 


50.2 


162 
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6279 


23838-23857 


6318 


23996-23977 


50.4 


50.6 


159 


6280 


24030-24051 


6319 


24407-24386 


57.6 


55.7 


378 


6281 


24388-24407 


6320 


24581-24563 


50.4 


50.1 


194 


6282 


24559-24579 


6321 


24938-24921 


52.0 


50.4 


380 


6283 


24922-24941 


6322 


25184-25166 


50.1 


51.2 


263 


6284 


25201-25220 


6323 


25400-25382 


51.1 


51.4 


200 


6285 


25363-25381 


6324 


25646-25627 


51.1 


50.5 


284 


6286 


25656-25681 


6325 


25839-25814 


54.5 


56.4 


184 


6287 


25761-25782 


6326 


25982-25961 


54.6 


54.3 


222 


6288 


26039-26058 


6327 


26189-26166 


54.0 


53.0 


151 


6289 


26184-26205 


6328 


26333-26310 


50.9 


51.8 


150 


6290 


26422-26442 


6329 


26660-26641 


51.3: 


50.2 


239 


6291 


26571-26589 


6330 


26739-26715 


51.7 


53.2 


169 


6292 


26733-26752 


. 6331 


26960-26941 


51.1- 


52.2 


228 


6293 


26866-26885 


6332 


27139-27117 


50.7 


51.9 


274 


6294 


27300-27321 


6333. 


27458-27439 


51.2 


50.2 


159 


6295 


27361-27380 


. 6334 : 


27579-27558 


52.4 


51.1 


219 


6296 


27718-27740 


• 6335 


27917-27901 


50.7 


50.0 


200 


6297 


28041-28059 


6336 


28207-28189 


50.8 


50.8 


167 


6298 


28166-28189 


6337 


28411-28393 


52.2 


52.9 


246 


6299 


28395-28414 


6338 


28671-28653 


51.5, 


50.2 


277 


6300 


28654-28672 


6339 


28821-28800 


50.6 


52.3 


168 


6301 


28867-28885 


6340 


29184-29166 


51.5 


51.6 


318 


6302 


29183-29204 


6341 


29360-29342 


50.4 


50.4 


178 


6303 


29262-29279 


6342 


29626-29606 


50.1 


50.2 


365 


6304 


29538-29559 


6343 


29690-29670 


50.0 


50.4 


153 



Table 7: Primers 



Name 


SEQ ID NO: 


Co-ordinates 




Name 


SEQ ID NO: 


Co-ordinates 


AB4f 


6344 


19869-19888 




CB1r 


6367 


28011-28030 


AB5f 


6345 


20238-20257 




CB2r 


6368 


27671-27690 


BC1f 


6346 


20581-20600 




CB3r 


6369 


27301-27320 


BC2f 


6347 


20950-20969 




CB4r 


6370 


26931-26950 


BC3f 


6348 


21339-21358 




CB5r 


6371 


26575-26594 


BC4f 


6349 


21708-21727 




CB6r 


6372 


26191-26210 


BC5f 


6350 


22041-22060 




CB7r 


6373 


25841-25860 


BC6f 


6351 


22410-22429 




CB8r 


6374 


25476-25495 


BC7f 


6352 


22759-22778 




CB9r 


6375 


25126-25145 
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BC8f 


6353 


23131-23150 




CB10r 


6376 


24791-24810 


BC9f 


6354 


23500-23519 




CB11r 


6377 


24422-24441 


BC10f 


6355 


23841-23860 




CB12r 


6378 


24031-24050 


BC11f 


6356 


24210-24229 




CB13r 


6379 


23673-23692 


BC12f 


6357 


24560-24579 




CB14r 


6380 


23298-23317 


BC13f 


6358 


24941-24960 




CB15r 


6381 


22928-22947 


BC14f 


6359 


25310-25329 




CB16r 


6382 


22567-22586 


BC15f 


6360 


25675-25694 




CB17r 


6383 


22196-22215 


BC16f 


6361 


26044-26063 




CB18r 


6384 


21831-21850 


BC17f 


6362 


26413-26432 




CB19r 


6385 


21431-21450 


BC18f 


6363 


26763-26782 




CB20r 


6386 


21073-21092 


BC19f 


6364 


27132-27151 




CB21r 


6387 


20715-20734 


BC20f 


6365 


27491-27510 




BA1r 


6388 


20345-20364 


BC21f 


6366 


27845-27864 




BA2r 


6389 


19969-19988 










BA3r 


6390 


19599-19618 










BA4r 


6391 


19228-19247 










BA5r 


6392 


18852-18871 



Table 8: Primers 

Name SEQ ID NO 



F1 . 


6393 


F2 


6394 


F3 


6395 


F4 


6396 


F5 


6397 


F6 


6398 


F7 


6399 


F8 


6400 


F9 


6401 


F10 


6402 


F11 


6403 


F12 


6404 


F13 


6405 


F14 


6406 


F15 


6407 


F16 


6408 


F17 


6409 


F18 


6410 


F19 


6411 


F20 


6412 


F21 


6413 


F22 


6414 


F23 


6415 


F24 


6416 


F25 


6417 


F26 


6418 


F27 


6419 


F28 


6420 


F29 


6421 


F30 


6422 


F31 


6423 



Co-ordinates 


Name 


1-19 


• R1 


292-310 


R2 


721-742 


R3 * 


984r1003 


R4 


1420-1441 


R5 


1740-1764 


R6 


2226-2245 


. R7 . 


2742-2763 


: R8 


3007-3031 


: R9 


3453-3476 


R10 


4007-4027 


R11 


4366-4387 


R12 


46S&4677 


R13 


5239-5260 


R14 


5593-5612 


R15 


6271-6291 


R16 


7253-7272 


R17 


7615-7635 


R18 


7845-7867 


R19 


8143-8166 


R20 


8553-8575 


R21 


9244-9267 


R22 


10009-10027 


R23 


10242-10265 


R24 


10766-10785 


R25 


11265-11287 


R26 


11705-11726 


R27 


12040-12058 


R28 


12366-12384 


R29 


12948-12966 


R30 


13237-13258 


R31 



SEQ ID NO Co-ordinates 



6441 


334-315 


6442 


749-731 


6443 


1077-1058 


6444 


1479-1460 


6445 


1834-1811 


6446 


2251-2232 


6447 


2749-2728 


6448 


3082-3058 


6449 


3497-3478 


6450 


3877-3853 


6451 


431 64295 


6452 


4708-4690 


6453 


5115-5092 


6454 


5560-5542 


6455 


6291-6271 


6456 


7171-7153 


6457 


7677-7654 


6458 


7936-7915 


6459 


8189-8170 


6460 


8388-8369 


6461 


9254-9236 


6462 


9990-9969 


6463 


10244-10223 


6464 


10783-10763 


6465 


11305-11287 


6466 


11730-11709 


6467 


11984-11967 


6468 


12406-12388 


6469 


12994-12976 


6470 


13324-13300 


6471 


13963-13945 
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F32 


6424 


14080-14098 


F33 


6425 


14882-14906 


F34 


6426 


15113-15134 


F35 


6427 


15364-15387 


F36 


6428 


15513-15532 


F37 


6429 


16073-16096 


F38 


6430 


16366-16385 


F39 


6431 


16832-16852 


F40 


6432 


17354-17372 


F41 


6433 


17603-17623 


F42 


6434 


18011-18030 


F43 


6435 


18270-18292 


F44 


6436 


18550-18571 


F45 


6437 


18960-18981 


F46 


6438 


19310-19329 


F47 


6439 


19707-19731 


F48 


6440 


19833-19851 



Table 9: Primers 





Name 


SEQ ID NO: 


1 


CB12R 


6488 


2 


R0010 


6489 


3 


R0011 


. 6490 


4 


R0012 


6491 


5 


BNI-ED 


6492 


6 


BNI-EU 


6493 


7 


SAR1S-U 


6494 


8 


SARIAs-D 


6495 


9 


SAR1S 


6496 


10 


SARIAs 


6497 


11 


IN2-U 


6498 


12 


IN4-D 


6499 


13 


IN-2 


6500 


14 


IN-4 


6501 


15 


IN-6 


6502 


16 


IN-7 


6503 


17 


C0R1-U 


6504 


18 


C0R2-D 


6505 


19 


COR-1 


6506 


20 


COR-2 


6507 


21 


HKUF-U 


6508 


22 


HKUR-D 


6509 


23 


HKU-F 


6510 


24 


HKU-R 


6511 


25 


1451-D 


6512 i 


26 


1451-U 


6513 I 


27 


690-D 


6514 I 


28 


690-U 


6515 i 


29 


690-D2 


6516 I 



R32 


6472 


14561-14540 


R33 


6473 


15145-15124 


R34 


6474 


15383-15363 


R35 


6475 


15605-15585 


R36 


6476 


15999-15978 


R37 


6477 


16404-16380 


R38 


6478 


16777-16758 


R39 


6479 


17359-17340 


R40 


6480 


17573-17552 


R41 


6481 


17883-17862 


R42 


6482 


18225-18205 


R43 


6483 


18648-18629 


R44 


6484 


19004-18983 


R45 


6485 


19217-19195 


R46 


6486 


19719-19701 


CM "7 


0*KJ/ 


i yy<d i -1 yyu i 




Name 


SEQ ID NO*' 


37 


EMC8-D2 


6524 


38 


EMC8-U2 


6525 * 


39 


EMC11-D 


6526 ' 


40 


EMC11-U 


6527 • 


41 


ORF1 B-D 


6528 


42 


ORF1B-U 


6529 


43 

TV 


ORFS-D 


6530 

www 


44 


ORFS-U 


6531 


45 


E7-717F 


6532 


46 


E8-85R 


6533 


47 


E8-307F 


6534 


48 


E1 1-771 F 


6535 


49 


E1 1 -96R 


6536 


50 


CON1-F 


6537 


51 


CON1-U 


6538 


52 


CON2-F 


6539 


53 


CON2-R 


6540 


54 


CON3-F 


6541 


55 


CON3-R 


6542 


56 


15-F 


6543 


57 


15-R 


6544 


58 


15-F2 


6545 


59 


15-R2 


6546 


60 


13-F 


6547 


61 


13-R 


6548 


62 


13-F2 


6549 


63 


13-R2 


6550 


64 


CONTIG-F 


6551 


65 


QT3-R 


6552 
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30 


690-U2 


6517 


31 


EMC7-D 


6518 


32 


EMC7-U 


6519 


33 


EMC7-D2 


6520 


34 


EMC7-U2 


6521 


35 


EMC8-D 


6522 


36 


EMC8-U 


6523 ! 



66 


QT3-F 


I 6553 


67 


QIN-R 


6554 


68 


QIN-F 


6555 


69 


AB1-F 


6556 


70 


AB2-F 


6557 


71 


AB3-F 


6558 


72 


AB1-R 


6559 
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Table 10: Features of the predicted proteins and open reading frames of the SAKS virus 





SARS ORF 


Length 


Role 


Cleavage 


Features 


Cons d 




fSEQ ID NO) 






site 




• 




P28 (9766) 


179 


Leader protein 


179 (G/G) # 




+ 




P65(9767) 


639 


Homologue of MHV p65 
cleavage product 


818 (G/A) 




+ 




Nsp1 
(9768) 


2422 ** 


Papain like protease, 
cleaves the first two 


3240 (Q/S) 


phosphoesterase domain 
Zn binding domain 


+ 


0RF1a 


Nsp2 
(9769) 


306 


proteins nsp1-nsp12 






+ 




Nsp3 (9770) 


290 


? 


3836 (Q/S) 


5 TMDs 


+ 




Nsp4 (9771) 


83 


? 


391 9 (Q/A) 


1TMD 


+ 




Nsp5 (9772) 


198 


? 


4117 (Q/N) 




+ 




Nsp6 (9773) 


113 


? 


4230 (Q/A) 




+ 




Nsp7 (9774) 


139 


? 


4369 (Q/S) 


Putative growth factor-like motif 






Nsd9 (9775* 


932 


niin l^viyil Iclaoc 






+ 




Nsp10(9776) 


601 


Putative helicase 


5899 (Q/A) 


Metal binding domain, 


+ 


.0 

T— 

Li- 










ATP/GTP bindinn domain 

* ■ I 1 # 1 ■ %JH lull iy Uul ■ lu l 1 ■ 




en 

0 


Nsp11 (9777) 


527 


? 


6426 (Q/S) 




+ 




Nsp12(9778) 


346 


? 


6772 (Q/A) 




+ 




Nsp13 (9779) 


298 


? 


- 




+ 




Spike (S) 
(6042) 




Major antigenic 
determinant, contains the 
receptor-binding domain 




Leader peptide, 1 TMD, 17 N- 
glycosylation sites 






Orf3 (6043) 

* 


274 


? 




2 TMDs, 1 N-^ycosylationsfte f -10 
O-gtyrosylatton sites 






Orf4 (6044) 


154 


? 






- 




Envelope (E) 
(6045) 


76 


Associated with viral 
envelope 




1 TMD, 2 N-gJvcosyfation sites , 


+ 


jctural regfoi 


Matrix (M) 
(6046) 


221 


Associated with viral 

pnvplniv* mpmhrano 

spanning protein 




3 TMDs, 1 N-gtycosylation site 


+ • 


Orf7 


63 


? 




1 TMD 


- 


c 


(6047) 














Orf 8 (6048) 


122 


? 




1 TMD 






Orf9 (6049) 


44 


? 




Surface-associated 






Orf 10 


39 


? 




Surface-associated 






Orfl 1(6050) 


84 


? 




1 N-gtycosylation site 






NudeocapsM (N) 
(6052) 


422 


Associated with viral genomic RMA 




phosphoprotein 


+ 




Orf13 


98 


? 




1 O-glycosylation site ! 





TMD: predicted transmembrane domain. 

Cons *: + indicates presence of corresponding protein at least in one of the other coronaviruses 
#: Alternatively, cleaved after Gly-Gly (Le. at G/A) to give a 180mer 
5 ##: This 2422mer may be further cleaved after residue 1922 (Gly-2740 of SEQ ID NO: 6039) to 
give a 1922mer PLpro containing the Zn-binding motif (SEQ K> NO: 7254) and a SOOrner. 



-395- 



WO 2004/092360 



PCT/US2004/011710 



Table 11: Protein homologies between SARS and other corona viruses 

Numbers indicate percentage of aminoacid identity between SARS proteins and corresponding 
gene products of other coronaviruses. More conserved pairs are in bold; more variable pairs are 
underlined 





group 1 


group 2 


group 3 


Proteins 


229E 


TGV 


PEDV 


MHV 


BCoV 


AIBV 


REPUCASE REGION 


leader protein pzo 




<zu 


<c\J 


4/ 


-on 
<SiU 


<dSJ 


poo nomoiogiie 


<ZSJ 


£.0 




<C\J 


on 


<C\J 


nspi \rUr protease/ 


OR R 


OR A 


OR A 


QQ 


ou 


OR 


nsp2 (3CL protease) 


40.4 


43.8 


44.6 


50 


48.4 


41 


nsp3' 


30 


27 


29.4 


34.2 


35.5 


28.5 


nsp4 


38.6 . 


4a2 


39.8. 


47.5 


46.1 


37.3 


nsp5 


46\2 


42.9 . 


43.9 


46.8 


47.3 


38.7 


nsp6 


45.1 


38.9 


45.1 


45.1 


46.9 


39.8 


nsp7 


53.8 


54.5 


56.1 


56.2 


55.4 


58.3 


nsp9 (polymerase) 


59.8 


59.6 


60 


67.3 


66.9 


62.4 


nsplO(hellcase) 


60.7 


62 


62.3 


67.2 


68.6 


58.9 


nsp11 


52.3 


53.7 


52.3 


57.6 


57.6 


52 


nsp12 


43.1 


43 


45.4 


45.9 


45 


40.2 I 


nsp13 


56.4 


54.4 


55,3 


63 


65 


53.4 


STRUCTURAL REGION 


Spike (S) 


2M 


31.6* 


30.3 


31.1 


31 


32.7* 


Envelope (E) 


33* 


27.9 


20 


23 


26.5 


23.2 


Matrix glycoprotein (M) 


30.6 


32.5 


34.8 


40.8 


41.9 


32.5 


Nucleocapsid (N) 


26.9 


30.1 


29.5 


37.3 


37.4 


31.5 



* These three alignments were obtained only on a fragment of the whole protein. 
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Table 12: Nucleotide and aminoadd differences between five S ARS isolates 







FRA* 


TOR2* 


Urtoani* 


CUHK* 


HKU* 


position 0 


base/aminoacid 


base/aminoacid 


base/aminoacid 


base/aminoacid 


base/aminoacid 
















ORFIa 


2557 


A/Thr 


G/Ala 


G/Ala 


ft/Ala 


ft/Ala 


2601 


T/Val 










7746 


G/Pro 






1 




7919 


C/Ala 




f / VGU 






7930 


G/Asp 








A/Asn 


8387 


G/Ser 








C/Thr 


8416 


G/Arg 








C/TTir 


9404 


T/Val 






C/Ala 




9479 


T/Val 






C/Ala 




11448 


T/lle 


C 


c 


C 


C 
















ORFIb 


13494 


GT/Val 








AG/Ser 


16622 


C/Ala 




T 






17564 


T/Asp 






C/Glu 




17846 


C/Arg 






T 




18065 


G/Lys 








A 


18965 


A/lle 


T 


T 


T ! 


T 


19064 


A/Glu 






G 




19084 


T/lle 


C/Thr 


C/Thr 


w/ 1 Ml 


w/ 1 nr 
















spike 


.21721 


G/Gly 






A/Asp 






T/lle 






C/Thr 




23220 


T/Ser 


G/Ala 








24872 


T/Leu 




c 






24933 


T/Phe 


C/Leu 


C/Leu 


C/Leu 


C/Leu 
















ORF3 


25298 


G/Gly 


A/Arg 










OCCCQ 


1/ mot 








A/Lys 
















matrix 


26600 


T/VaJ 


C/Ala 


C/Ala 


C/Ala 






26857 


T/Ser 




C/Pro 




















| ORF10 


27827 


T/Cys 






C/Arg 


















nucleocapsid 


28268 


T/lle 


emir 


C/Thr 


C/TTir 


C/Thr 



* SARS corona virus ERA (accession number AY3 10120) 
SARS coronavirus TOR2 (accession.number AY274119) 
5 SARS coronavirus Urbani (accession number AY278741) 

SARS coronavirus CUHK-W1 (accession number AY278554) 
SARS coronavirus HKU-39849 (accession number AY278491) 

° Ibe position is based on the FRA sequence. 

0 
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TABLES 13-25: T-epitope predictions for SEQ ID NOS: 6039-6050 & 6052 

Epitope predictions were performed at http://www.mpiib-berlm,mpg.deMAPPP^ using a 

miTiimnm score of 0.5 and the BIMAS matrix, with a maximum of 20 results being selected. Hie 
analysis revealed 9mer and lOmer epitopes. 



Table 13: Epitopes for SEQ ID NO: 6039 



HLA Al - 9 mers 


Maximum possible score using this molecule type 


5625 


I Rank || Start position 


Sequence 


% of max. score 


Score 


I 1 II 1867 


SEO ID NO: 7400 


8 % 


450 


1 2 || 4139 


SEO ID NO: 7401 


5.55 % ! 


312.5 


| 3 || 88 


SEO ID NO: 7402 


4 % | 


225 


I 4 || 4249 


SEQ ID NO: 7403 


3.55 % 


200 


I 5 || 4059 I SEQ ID NO: 7404 


2.22 % 


125 


1 6 || 2027 . 


SEQ ID NO: 7405 


1.6 % ' 


90 


I 7 II 3413 


SEQ ID NO: 7406 


1.11 % 


62.5 


I 8 H 1823 


SEQ ID NO: 7407 


0.88 % 


50 


! 9 2798 


SEQ ID NO: 7408 


0.88 % 


50 


1 10 H 220 


SEQ ID NO: 7409 


0.8 % 


45 


I 11 II 3738 


j SEQ ID NO: 7410 


0.8 % 


45 


I 12 H 4182 


SEQ ID NO: 7411 


0.8 % 


45 


1 13 || 4174 


SEQ ID NO: 7412 


0.66 % 


37.5 


I 14 1940 


SEQ ID NO: 7413 


0.55 % 


31.25 


1 15 II 38 


SEQ ID NO: 7414 


0.48 % 


27 


1 16 II 1231 


SEQ ID NO: 7415 


0.44 % 


25 


S 17 || 1613 


SEQ ID NO: 7416 


0.44 % 


25 


S 18 II 3645 


SEQ ID NO: 7417 


0.44 % 


25 


1 19 || 4192 


SEQ ID NO: 7418 


0.44 % 


25 


j 20 II 378 


SEQ ID NO: 7419 


0.4 % 


22.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type 


5625 1 


Rank 


Start position 


Sequence 


| % of max. score 


Score | 


1 


1867 


SEQ ID NO: 7420 


8% 


450 | 


2 


1495 


SEQ ID NO: 7421 


4% 


225 | 


3 


3921 


SEQ ID NO: 7422 


2.4 % 


135 


4 


486 


SEQ ID NO: 7423 


2.22 % 


125 | 


5 


4139 


SEQ ID NO: 7424 


2.22 % 


125 | 
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6 


62 


SEQ ID NO: 


: 7425 


1.6 % 


90 


7 


1190 


SEQ ID NO 


: 7426 


1.6 % 


90 


8 


1284 


SEQ ID NO: 


: 7427 


1.6 % 


L 9° 


9 


3284 


SEQJD NO: 


: 7428 


1.6 % 


90 


10 


2921 


SEQ ID NO: 


: 7429 


1.2 % 


67.5 


11 


! 349 


| SEQJD NO 


: 7430 


0.8 % 


45 


12 


789 


| SEQ ID NO 


: 7431 


0.8 % 


45 


13 


1185 


1 SEO ID NO: 


: 7432 


0.8 % 


45 


14 


4184 


1 SEO ID NO: 

1 m^ mm ^£ A mm 1 V t 


: 7433 


0.8 % 


45 


15 


1313 


I SEO ID NO: 


: 7434 


0.64 % 


36 


16 


3948 


SEQ ID NO 


: 7435 


0.48 % 


27 


17 


I 149 


SEQ ID NO 


: 7436 


0.44 % 


25 


18 


941 


SEQ ID NO 


: 7437 


0.44 % 


25 


19 


1390 


SEQ ID NO 


: 7438 


0.44 % 


25 


20 


1613 


SEQ ID NO 


: 7439 


0.44 % 


25 



Maximum possible score using this molecule type 


I 12150 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


1010 


SEQ ID NO: 


7440 


1.48 % 


180 


2 1 3155 


SEQ ID NO: 


7441 


1.48 % 


180 


3 1229 


SEQ ID NO: 


7442 


1.23 % 


150 


4 | 2405 


SEQ ID NO: 


7443 


0.88 % 


108 


5 II 2 


SEQ ID NO: 


7444 


0.74 % 


90 


6 II 2304 


SEQ ID NO: 


7445 


0.74 % 


90 


7 i 2358 


SEQ ID NO: 


7446 


0.74 % 


90 


8 


1 3160 


SEQ ID NO: 


. 7447 


0.74 % 


90 


9 3771 


SEQ ID NO: 


: 7448 I 


0.74 % 


90 


10 || 4007 


SEQ ID NO 


: 7449 I 


0.74 % || 90 


11 » 3079 ' 


SEQ ID NO: 


: 7450 8 0.66 % 


81 


12 | 4045 


SEQ ID NO: 


: 7451 II 0.66 % 


81 


I 13 || 1081 


SEQ ID NO: 


: 7452 H 0.49 % 


60 


14 || 3268 


SEQ ID NO' 


: 7453 1! 0.49 % 


60 


15 1 4144 


SEQ ID NO 


: 7454 || 0.49 % 


60 


16 || 614 


SEQ ID NO 


: 7455 II 0.37 % 


45 


17 II 728 


SEQ ID NO 


: 7456 || 0.37 % 


45 


18 || 1537 


SEQ ID NO 


: 7457 


! 0.37 % 


45 


19 I 313 


SEQ ID NO 


: 7458 II 0.32 % 


40 


20 || 1744 


I SEQ ID NO 


: 7459 | 0.32 % 


40 
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HLA A3 - 10 mers 1 



Maximum possible score using this molecule type 


1 o i en 

IlIJU i 


Rank 


Start oosition 


Sequence 


% of max. score 


9v>urc i 


1 1 


62 


SEQ ID NO: 7460 


4.44 % 


•>d.n 1 

jtu | 


2 [ 


2151 


SEQ ID NO: 7461 


2.46 % 


Tnn 1 


3 1 


633 


SEQ ID NO: 7462 


2.22 % 


"57n 1 
z/u I 


4 1 


1 1 5R 


SEQ ID NO: 7463 2.22 % 




q I 


2565 i 


SEQ ID NO: 7464 | 2.22 % 


•>7n 1 
Z/U 1 


o i 


29QR 


SEQ ID NO: 7465 I 


1.77 % 


ZJLD | 


7 i 




SEQ ID NO: 7466 I 


1.11 % 


1 JD | 


R 1 

o i 




SEQ ID NO: 7467 | 


0.98 % 


1 "5f» I 

1ZU I 


Q 1 


21 Rfi ! 


SEQ ID NO: 7468 | 


0.74 % 


on I 


1 n I 


3869 


SEQ ID NO: 7469 


0.74 % 


on I 
yu | 


1 1 I 


2308 


SEQ ID NO: 7470 | 


0.66 % 


OX J 


12 | 


786 


SEQ ID NO: 7471 


0.55 % 


67.5 | 


13 | 


749 


SEQ ID NO: 7472 


0.49 % 


60 | 


14 | 


' 1080 


SEQ ID NO: 7473 


0.49 % 


60 | 


15 I 


! 2358 


SEQ ID NO: 7474 


0.49 % 


60 


16 | 


. 3955 


SEQ ID NO: 7475 


0.49 % 


60 


17 | 


714 


SEQ ID NO: 7476 


0.37 % 


45 


18 


1081 


SEQ ID NO: 7477 j| 0.37 % || 45 


19 | 1170 


SEQ ID NO: 7478 


0.37% ; || 45 


20 B 1228 


SEQ ID NO: 7479 


0.37% || 45 







HLA A24 - 9 mers 




Maximum possible score using this molecule type 


1596.672 


Rank 


Start position 


|| Sequence 


% of max. score 


Score 


1 


3797 


H SEQ ID NO: 7480 I 


37.57 % I 


600 


2 


4202 


|| SEQ ID NO: 7481 


37.57 % \ 


600 ! 


3 


3189 


1 SEQ ID NO: 7482 I 


25.05 % ! 


400 


4 


1864 


H SEQ ID NO: 7483 I 


I 23.14 % | 


I 369.6 


5 


1066 


|| SEQ ID NO: 7484 | 


1 22.54 % 


S 360 


6 


2143 


j SEQ ID NO: 7485 ! 


22.54 % 


! 360 


7 


2693 


II SEQ ID NO: 7486 | 


22.54 % || 360 


8 


1426 


j SEQ ID NO: 7487 | 


18.78 % 


I 300 


9 


1238 


|| SEQ ID NO: 7488 | 


18.03 % 


! 288 


10 


3768 


|| SEQ ID NO: 7489 | 


I 18.03 % 


! 288 


11 


797 


1 SEQ ID NO: 7490 


I 15.03 % 


! 240 
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12 | 


1882 


I SEQ ID NO: 7491 


15.03 % 


240 


13 I 


1490 


I SEQ ID NO: 7492 


13.77 % 


220 


14 || 


2237 | 


I SEQ ID NO: 7493 


13.77 % 


220 


15 | 


95 I 


I SEQ ID NO: 7494 


12.52 % 


200 


16 | 


1821 


I SEQ ID NO: 7495 


12.52 % 


200 


17 r 


2289 | 


1 SEQ ID NO: 7496 


12.52 % 


200 


18 II 


3080 || SEQ ID NO: 7497 


12.52 % 


200 


19 I 


3660 i 


SEQ ID NO: 7498 


12.52 % 


200 


20 | 


4354 I 


SEQ ID NO: 7499 1 


12.52 % 


200 



HLA A24 - 10 mers 

Maximum possible score using this molecule type tf"l596.672 

Rank II Start position | Sequence II % of max. score II ScortT^ 

1 j 2143 II SEQ ID NO: 7500 fl 37.87% II 604.8 

2 II 1159 II SEQ ID NO: 7501 [ 26.30% H 420 

3 I .1650 lf~SEQ ID NO: 7502 || 26.30% II 420 

4 || 1150 jfSEQ ID NO: 7503 j 18.78% II 300 

5 j 2763 I SEQ ID NO: 7504 j 18.78% I 300 

I I 3165 II SEQ ID NO: 7505 II 18.78% II 300 

7 || . 3201 II SEQ ID NO: 7506 II 15.03% II 240 

8 1 ,3694 II SEQ ID NO: 7507 11 15.03% j. 240 

9 II 4204 II SEQ ID NO: 7508 II 15.03 % j 240 " 

10 II 1692 II SEQ ID NO: 7509 II 13.77% II. 220 I 

II j 797 II SEQ ID NO: 7510 || 12.52% j 200 

12 j 1610 II SEQ ID NO: 7511 j 12.52% 1 200 

13 1789 ll SEQ ID NO: 7512 II 12.52 % j 200 

14 [ 1881 II SEQ ID NO: 7513 1 12.52 % II 200 

15 II 3090 v j SEQ ID NO: 7514 || 12.52 % IT 200 

16 II 3763 j SEQ ID NO: 7515 II 12.52 % j 200 

17 j 2569 II SEP ID NO: 7516 II 11.27% || 180 

18 II 194 II SEQ ID NO: 7517 I 9.39 % j 150 

19 II 1771 II SEQ ID NO: 7518 || 9.39 % || 150 I 

20 II 2488 II SEQ ID NO: 7519 H 9.39 % H 150 I 



HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


3925227.1 


Rankll Start position H Sequence I % of max. score 


Score | 


1 II 2308 | SEQ ID NO: 7520 II 0.20 % 


8144.13515256 


2 II 3729 || SEQ ID NO: 7521 H 0.10 % 


4047.23088 



-401- 

I 
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3 i 3574 USEQIDNC 


l: 7522 || 0.09 % || 3547.4996614 


4 | 3615 HSEQIDNO 


C 7523U 0.06% H 2722.682592 


5 II 3159 HSEQIDNO 


»: 7524 | 0.05% || 1999.734264 


6 II 2339 H SEQ ID NO 


»: 7525 1 0.03 % H 1551.92907744 


7 || 2201 HSEQIDNO 


: 752611 0.03 % II 1521.53694 


8 | 3559 I SEQ ID NO 


:7527fl 0.02% fl 1174.38939504 


9 II 3085 HSEQIDNO 


: 752811 0.02 % II 1146.296448 


10 || 4070 || SEQ ID NO 


: 7529 H 0.02 % 


970.4103696 


11 H 3708 | SEQ ID NO 


: 7530 


0.02 % 


958.92888 


12 || 3098 II SEQ ID NO 


: 7531 


0.02 % 


942.678 1 


13 I 1362 8 SEQ ID NO 


: 7532 


0.02 % 


900.6984 


14 8 3563 H SEO ID NO 


: 7533 


0.01 % 


735.86016 


15 II 3774 H SEO ID NO 


: 7534 


I 0.01 % 


687.655656 


16 H 4242 II SEO ID NO 


: 7535 


0.01 % 


685.78272 . 


1 17 H 2340 HSEQIDNO 


: 7536 


0.01 % 


668.37342936 


I 18 


1 650 || SEQ ID NO 


: 7537 


0.01 % 


640.1983392 


1 19 || 3862 H SEQ ID NO 


: 7538 


II 0.01 % 1 


620.57772 


1 20 M 2860 HSEQIDNO 


: 7539 


H o.oi % 


607,88448 




HLA A 0201 - 10 mers ~~1 


Maxim 


urn possible score using this molecule type || 3925227.1- | 


Rank 


Start position 


Sequence 


% of max. score I Score I 


1 


2307 


ISEQ ID NO: 


7540 


0.40 % H 15915.66281448 


2 


2201 


ISEQ ID NO: 


7541 


0.12 % | 4772.09313 


3 


3558 HSEQIDNO: 


7542 


0.05 % H 2295.04855632 




1772 H SEQ ID NO: 


7543 


0.04 % H 1759.6656 


5 || 3087 HSEQIDNO: 


7544 


0.03% | 1215.76896 


6 H 2339 I 


ISEQ ID NO: 


7545 


0.02 % H 1116.29986272 


7 H 2308 


ISEQ ID NO: 


75461 


0.02% || 970.14776112 


8 II 3061 


ISEQ ID NO: 


75471 


0.02% H 836.2525104 


9 H 2748 HSEQIDNO: 


75481 


0.01 % H 726.706344 


10 || 3837 HSEQIDNO: 


75491 


0.01 % I 720.8292 


1 11 II 59 HSEQIDNO: 


7550|| 


0.01 % | 650.3112 


1 12 fl 2877 HSEQIDNO: 


7551 1 


0.01% | 620.22996 


13 | 4114 HSEQIDNO: 


7552|| 


0.01 % | 559.8936 


14 H 805 HSEQIDNO: 


7553| 


0.01 % | 484.4565072 


15 II 1655 HSEQIDNO: 


7554 [ 


0.01 % U 437.48208 


16 || 611 HSEQIDNO: 


75551 


0.00 % f 319.9392 


17 || 1961 IISEOIDNO: 


75561 


0.00% H 305.94186 
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18 


1223 


SEQ ID NO: 7557 


0.00 % 


II 289.08792 


19 


852 


SEQ ID NO: 7558 


0.00 % 


11 285.67242 


20 


2139 


SEQ ID NO: 7559 


0.00 % 


II 284.845869 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


| Jo 


Rank 


Start position ||_ Sequence 


% of max. score 


Score 


l 


4200 || SEQ ID NO: 7560 


t>u % 


18 


2 


Zol 


ccn Trv mo- tct^ 1 
bfcO ID IVIU: 75ol 


1 25 % 


[ 9 




OTIC 

3236 


bEQ ID NO: 7562 


ICO/ 

25 vo 


9 


A 

4 


509 


SEQ ID NO: 7563 


16.66 % 


6 


5 


o4o 


oty ID IMU. /504 


16.66 % 


6 


6 


T 1 no 
2193 


bfcQ ID NO: 7565 


AC ft/ 

16.66 % 




-7 
/ 


J54Z 


CCO TO MO- 

bbv2 ID NO. /5ob 


i C CC O/ 


6 


o 
o 


541 


SEO ID NO' 7567 


-f r* n/ 

15 % 


L 5.4 


y 


1748 


SEQ ID NO: 7568 


i C 0/ 

12.5 % 


4.5 


10 


829 


SEQ ID NO: 7569 


1 1 1 1 % 

Hill /o 


4 


11 


1149 


SEQ ID NO: 7570 


ii.il % ■ 


4 


12 


2027 


SEQ ID NO: 7571 


ii.il % 


4 


1 13 


2576 


SEQ ID NO: 7572 


li.ii % 


4 


1 14 


873 


SEQ ID NO: 7573 


8.33'% 


3 


15 


2725 


SEQ ID NO: 7574 


! 8.33 % 


3 


16 


3541 . 


SEQ ID NO: 7575 


8.33 % 


3 


17 


. 1837 


SEQ ID NO: 7576 


7.5 % 


2.7 


18 


2475 


SEQ ID NO: 7577 


7.5% 


2.7 


19 


2703 


SEQ ID NO: 7578 


7.5% 


2.7 


20 


1823 


SEQ ID NO: 7579 


6.66 % 


2.4 



HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


3541 


SEQ ID NO: 7580 


50 % 


18 


2 


281 


SEQ ID NO: 7581 


25 % 


9 


3 


1495 


SEQ ID NO: 7582 


25 % 


9 


4 


2303 


SEQ ID NO: 7583 


25 % 


9 


5 


2616 


SEQ ID NO: 7584 


25 % 


9 


6 


48 


SEQ ID NO: 7585 


16.66 % 


6 


7 


1394 


SEQ ID NO: 7586 


16.66 % 


6 


8 


1499 


SEQ ID NO: 7587 


1 16.66 % 


6 
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g 


1862 | 


SEO ID NO: 7588 


16 66 % 


6 J 


10 


1163 


SEO ID NO: 7589 | 


11 11 % 

JL. JL • Jk JL # w 


4 


i 11 


Anna 
4005 


II SEQ ID NO: 7590 I 


11 11 % 

JL JL • JL JL /V 


4 J 


1 12 


An A A 

4344 


1 


SEO ID NO: 7591 I 


11 11 % 


4 ! 


13 
1J 


633 


1 


SEO ID NO: 7592 | 


10 % 

JL w /U 


3 6 


14 


119 


t 


SEO ID NO: 7593 I 


KJm *J*J /U 


3 


15 


1190 


1 


SEO ID NO: 7594 I 


ft 33 % 

O.JJ /o 


-j 


Ifi 
lb 


1195 


1 


SEQ ID NO: 7595 I 


8.33 % 


3 


17 


1725 


II SEO ID NO: 7596 [ 


8.33 % 


3 


.18 


2728 


II SEQ ID NO: 7597 I 


8.33 % 


3 


19 


2895 


|| SEQ ID NO: 7598 | 


8.33 % 


3 


20 1 


! 3033 


I 


SEQ ID NO: 7599 II 8.33 % 


3 




HLA B7 - 9 mere 


Maximum possible score using this molecule type 


I 5400 


Rank | 


Start position 


Sequence 


% of max. score 


LScore 


l 




ccn m mo- 7cnn 
ocy wj imu. /duu 


4.44 % 


I 240 


2 


2580 


SEQ ID NO: 7601 


4.44 % 


I 240 


3 I 


1703 


SEQ ID NO: 7602 


3.70 % 


I 200 


■ 4 | 


113 


SEQ ID NO: 7603 


2.22 % 


| 120 


1 5 | 


168 


SEQ ID NO: 7604 


2.22 % 


I 120 


1 6 | 


2842 


SEQ ID NO: 7605 


2.22 % 


I 120 


7 


4027 


SEQ ID NO: 7606 


2.22 % 


I 120 


8 


3680 


SEQ ID NO: 7607 


1.66 % 


| 90 


9 


2085 


SEQ ID NO: 7608 


1.48 % 


1 80 


10 


2492 


SEQ ID NO: 7609 


1.48 % 


I 80 


11 


2660 


SEO ID NO: 7610 


1.48 % II 80 


12 


2906 


SEQ ID NO: 7611 


1.48 % 


1 80 


13 


3346 


SEQ ID NO: 7612 


1.48 % 


I 80 


14 


4038 


SEQ ID NO: 7613 


1.48 % 


| 80 


15 


1163 


1 SEQ ID NO: 7614 


1.11 % 


| 60 


16 


1457 


SEQ ID NO: 7615 


1.11 % 


1 60 


17 


2351 


SEQ ID NO: 7616 


1.11 % 


1 60 


18 


2471 


SEQ ID NO: 7617 


1.11 % 


1 60 


19 


3499 


SEO ID NO: 7618 


1.11 % 


I 60 


20 


3635 


SEQ ID NO: 7619 


1.11 % 


! 60 



| HLA B7 - 10 mere 

Maximum possible score using this molecule type II 5400 
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Rank 


Start Dosition 


1 ^pniipnre 


o/a of may. score 


Score 


1 

X 


1703 


1 SEO ID NO* 7620 


3 70 % 


200 


2 


\ 17 


1 SEO ID NO" 7621 


2 22 % 


120 


3 


3008 


1 SEO ID NO' 7622 


2 22 % 


120 


4 


4106 

v X Uw 


1 SEO ID NO* 7623 


2 22 % 


120 


K 

•j 


3450 


\ SEO ID NO* 7624 


1 66 % 

X • vU /U 


90 


fi 
%j 


113 


SEO ID NO' 7625 


1 48 % 


80 


7 


X -v J 


1 SFO ID NO' 7626 


1 48 % 
x • to /o 


80 


Q 

o 


307 

JU/ 


1 ^FO TD NO* 7677 


1 48 % 


80 


Q 


780 

/ OU 


1 9FO TD NO' 7678 

1 Juy XL/ liVJ ■ / UZ.O 


1 48 % 


80 

OU 


1 o 

XU - 


1 ooo 


1 Qcn m MO- 7fi7Q 


1 48 0/ n 
X-*tO /0 


1 80 
| ou 


11 


1072 

I xU/ £. 


\ ^FO ID NO" 7630 

1 Jty XL/ IMU. / \J*J\J 


1 48 % 


80 

! OU 


17 


1404 


^FO TD NO" 7631 

JLU XL/ INU* /U*/A 


1 48 % 


80 
ou 




1Q80 


^FO TD NO' 7637 


1 48 % 


i 80 

l OU 


14 


77fi7 


^FO TD NO" 7633 

OtU XL/ INV/- /xjOO 


1 48 % 

X itO /O 


- 80 


1 q 
J. */ 


2543 


SFO TD NO' 7634 

»_?L.V£ XL/ IM v/ • / UJ7 


1.48% 


80 


16 


I 2906 


SEQ ID NO: 7635 


1.48% 


80 


17 


3077 


SEQ ID NO: 7636 


1.48 % 


80 


18 


3175 


SEQ ID NO: 7637 


1.48 % 


. 80 


19 


4195 


SEQ ID NO: 7638 


1.48% || 80 


20 


4251 


SEQ ID NO: 7639 


1.48 % . | 


1 80 



Table 14: Epitopes for SEQ ID NO: 6040 



HLA Al - 9 mers 


Maximum possible score using this molecule type : | 


I 5625 


Rank I 


1 Start position 


Sequence 


% of max. score 


Score 


1 


20 


SEQ ID NO: 7640 


0.04 % 


2.25 


2 


91 


SEQ ID NO: 7641 


0.01 % 


1 


3 


125 


SEQ ID NO: 7642 


0.01 % 


0.75 


4 


56 


SEQ ID NO: 7643 


0.00 % 


0.5 


.. 5 


145 


SEQ ID NO: 7644 


0.00 % 


0.5 



HLA Al - 10 mers 



Maximum possible score using this molecule type 


5625 


Rank 


| Start position 


Sequence 


% of max. score 


Score 


1 


| 20 


SEQ ID NO: 7645 


0.01 % 


0.9 


2 


I 56 


SEQ ID NO: 7646 


0.00 % 


0.5 


3 


1 71 


SEQ ID NO: 7647 


0.00 % 


0.5 
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I 4 || "IS 



I SEQ ID NO: 7648 fl 



PCT/DS2004/011710 
0.00% || Q.5 II 



HLA A3 - 9 mers ~| 


Maximum possible score using this molecule type 




Dank II 


Start position 


Sequence 


1 % of max. score 


I score 


1 II 
1 II 


115 


SEQ ID NO: 7649 


i 0.24 % 


i on 

I 30 


1 7 II 

1 2 H 


87 


SEQ ID NO: 7650 


^====================H 

\ 0.04 % 


I 6 


1 J H 


80 || SEQ ID NO: 7651 


[ 0.03 % 


4.05 


4 L 


125 


1 SEQ ID NO: 7652 


I ' 0 01 % 


1.8 


5 I 


39 


I SEQ ID NO: 7653 


0.01 % 


1 5 


1 6 i 


56 


SEQ ID NO: 7654 


! 0.01 % 


1.5 


7 1 


135 


SEQ ID NO: 7655 


0.00 % 


1.2 


8 1 


91 


SEQ ID NO: 7656 


r ~~ d.oo % 


1 


9 1 


119 


SEQ ID NO: 7657 


0.00 % 


1 


10 | 


141 


SEQ ID NO: 7658 


0.00 % 


0.9 


11 1 


150 


SEQ ID NO: 7659 I 


0.00% i 


0.6 


12 1 


137 


SEQ ID NO: 7660 I 


0.00 % 


0.54 



HLA A3 - 10 mers 


I Maximum possible score using this molecule type 


1 


1 12150 


1 Rank 


Start position 


! Sequence 


% of max. score 




| Score 


1 


I 36 


II SEQ ID NO: 7661 


0.24 % 




I 30 


2 


1 144 


II SEQ ID NO: 7662 


0.06 % 1 


1 8 


3 


! 101 


|| SEQ ID NO: 7663 


0.03 % 1 


1 4 


4 


i 99 


fl SEQ ID NO: 7664 


0.02 % 




1 3.6 


5 


80 


|| SEQ ID NO: 7665 


0.02 % 


1 


1 2.7 


6 


125 


fl SEQ ID NO: 7666 


0.01 % 


1 


I 1.6875 


7 


71 


II SEQ ID NO: 7667 


0.01 % 


1 


I 1.5 


8 


118 


II SEQ ID NO: 7668 


I 0.01 % 


1 


! 1.5 


9 


40 


fl SEQ ID NO: 7669 


0.01 % 


i 1.35 


10 


5 


fl SEQ ID NO: 7670 


0.00 % 


0.9 


11 


56 


II SEQ ID NO: 7671 


0.00 % 


1 


0.9 


12 


107 


II SEQ ID NO: 7672 


0.00 % 


1 


0.6 


13 


135 


II SEQ ID NO: 7673 


0.00 % 1 


0.6 


14 


141 


II SEQ ID NO: 7674 


0.00 % 


.1 


0.6 


15 


148 


II SEQ ID NO: 7675 


0.00 % 1 


0.6 


16 


116 


II SEQ ID NO: 7676 


0.00 % 


t 


0.5 



HLA A24 - 9 mers 



^06- 
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iMaximum possible score using this molecule type 


|| 1596.672 | 




blaiT pOSluon 


c An i ic*nf£* 1 


| /O Ul II Id A.. 


1 Score 


1 




ccn TD MO- 7fi77 

DCy A!-' IMVJt f\J/ f 


1 n c % 

x.Uj /o 


II 16.8 


z 


on 


QFO TH MO* 7fi7ft 


n 7^ oa 

U./3 /0 


II 12 


3 




cpn TH MH* 7fi7Q 


n c n oa 
u.du /0 


II 8 


4 


13/ 


CCO TH MO- 7fiQH 
btlj 1U IMU. /DoU 


U.OU VO 


II 8 

|l w 


5 


y 


CCO Th MO- 

oty ID IMU. /Dol 


U.«+D vo 


II 7.2 


6 


77 


prn TO MO« 7CQO 


n /i e o/ 


II 7.2 


7 


llz 


CCO TO MO» ~7CQ*2 

bb(J ID IMU. /OoJ 


n AC OA 
U.^rD V0 


II 7 2 1 


8 


73 


CeO TO MO« 7CO/1 

bbvj 1L> IMU. /Do4 


U.*H vo 


in 6 6 i 


9 


32 


CEO Tr\ MO- "7COC 

SEQ ID IMU: /bob 


n q*7 o/ 
U.o/ vo 


II 6 1 




110 


Tl""\ MA. ~7 CO C 

SEQ ID NO: 7686 


0.37 % 


1 6 


11 


140 


CCO Tr\ MO* 7CQ7 

bkvj AD IMU: /bo/ 


n "3*7 OA 

U.o/ vo 


1 1 

II 0 


12 


143 


CCO Tr\ MO* 7CQQ 

bcQ Id NU. /boo 


A Q7 OA 

U.J/ vo 


II 6 


13 


lo 


CCO TO MO* 7CQO 

bty ID NU. /boy 


n *3n oa 
U.JU vo 


II 4 8 


1 4 


54 


ceo Tr> mo- 7Con 
btlj ID NU. /byu 


n oa 
U.JU vo 


Il 4*8 


15 


108 


SEO ID NO" 7691 


0 30 % 


Il 4-8 


16 


141 


SEQ ID NO: 7692 


0.30 % 


II 4.8 


17 


92 


SEO ID NO: 7693 


0.27 % 


II 4.4 


18 


33 


SEQ ID NO: 7694 


0.25 % 


II 4 


19 


49 


SEQ ID NO: 7695 


0.25 % 


II 4 


20 


111 


SEQ ID NO: 7696 


0.25 % 


II 4 



HLA A24 - 10 mers 


Maximum possible score using this molecule type 


II 1596.672 


Rank 


Start position 


Sequence 


% of max. score 


j| Score 


1 


142 


SEO ID NO: 7697 


12.52 % 


II 200 


2 


110 I 


SEQ ID NO: 7698 


0.75 % 


HH 


3 


99 | 


SEQ ID NO: 7699 


0.50 % 




4 


8 1 


SEQ ID NO: 7700 


0.45 % 


1 7.2 


5 


140 1 


SEQ ID NO: 7701 


0.45 % 


1 7.2 


6 


32 


SEQ ID NO: 7702 | 


I 0.37 % 


II 6 


7 


17 1 


I SEQ ID NO: 7703 


I 0.30 % 


II 4.8 ! 


8 


53 i 


I SEQ ID NO: 7704 


0.30 % 


II 4.8 


9 


76 


1 SEO ID NO: 7705 


0.30 % 


II 4.8 


10 


107 


1 SEQ ID NO: 7706 


0.30 % 


II 4.8 


11 


111 1 


S SEQ ID NO: 7707 


0.30 % 


II 4.8 


12 


72 1 


1 SEO ID NO: 7708 


0.27 % 


II 4.4 


13 


91 || SEQ ID NO: 7709 


0.27 % 


II 4.4 



-4V7- 
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14 


| 31 1) SEQ ID NO: 7710 


0.25 % 


4 


15 


1 127 


SEO ID NO* 7711 


0.25 % 


4 


16 


139 


SEQ ID NO: 7712 


0.25 % 


4 


17 


80 


SEQ ID NO: 7713 


[ 0.22 % 


3.6 


18 


38 


SEQ ID NO: 7714 


0.18 % 


3 


19 


118 


SEQ ID NO: 7715 


! 0.18 % 


3 


20 


49 


SEQ ID* NO: 7716 


0.12 % 


2 




HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank) 


Start position | 


Sequence 


I % of max. score 


| Score 


1 | 


80 


SEQ ID NO: 7717 


0.00 % 


171.96732 


2 I 


147 


SEQ ID NO: 7718 


0.00 % 


51.46848 


3 [ 


143 


SEQ ID NO: 7719 


0,00 % 


11.6146182 


A 1 

4 1 


56 


SEQ ID NO: 7720 


0.00 % 


11.304684 


5 f 


10 


SEQ ID NO: 7721 


0.00 % 


1 a Oyl roc 


6 I 


6 


bcQID NO: 7/ZZ 


r\ r\r\ a/ 


6.56830734 


7 I 


26 


SEQ ID NO: 7723 


0.00 % 


O.U/014 


8 I 


141 


SEQ ID NO: 7724 


0.00 % 


5.981472 


9 I 


148 


SEQ 1 ID NO: 7725 


0.00 % 


5.194044 


10 


9 


SEQ ID NO: 7726 


0.00 % 


4.299183 


11 1 


137 


SEQ ID NO: 7727 


0.00 % 


4.299183 


12 | 


130 


SEQ ID NO: 7728 


0.00 % 


4.138344 


13 I 


84 


SEQ ID NO: 7729 


0.00 % 


3.42792 


14 | 


27 


SEQ ID NO: 7730 


0.00 % 


3.383484 


15 


2 


SEQ ID NO: 7731 


0.00 % 


3.381 


16 I 


62 


SEQ ID NO: 7732 


0.00 % 


3.251556 


17 | 


23 


SEQ ID NO: 7733 


0.00 % 


2.9542005 


18 | 


99 


SEQ ID NO: 7734 


0.00 % 


1,982232 


19 


! 33 


SEQ ID NO: 7735 


0.00 % 


1.86921 


20 | 


111 


SEQ ID NO: 7736 


0.00 % 


1.76402985 




HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 




Start position 


Sequence 


% of max. score 


Score 


1 


5 


SEQ ID NO: 7737 


0.00 % 


159.9696 


2 


25 


SEQ ID NO: 7738 


0.00 % 


69.552 


3 


80 


SEQ ID NO: 7739 


0.00 % 


36.5148 


4 


107 


SEQ ID NO: 7740 


0.00 % 


21.3624 
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5 


148 1 SEQ ID NO: 7741 I 


0.00 % | 


17.73576 


\j 


61 I SEQ ID NO: 7742 I 


0.00 % 1 


13 9104 


7 


•i A "7 

147 


oty ID IMU- / /HO 1 


0.00 % 


11.304684 


Q 
o 


rn 
DO 


oty 1LJ NU. / /HH 1 


0.00 % 1 


8.230458 


Q 


17 


CCn TH MO- ~7~7AtZ 1 

bfcQ ID NU. //4i> [ 


0.00 % 


7 3086111 

/ ■ ^\J\J\J XXX 


10 


110 


ceo m mo. iiacl 1 
bbQ ID NU: //4o 1 


0.00 % 1 


6.174104475 


1 1 

X JL 


9 


SEQ ID NO: 7747 | 


0 00 % 1 

W • W \J 1X1 | 


6 0858 

V/a V/U«^U 


i 1 1 

x#C 


99 


SEQ ID NO: 7748 | 


o oo % n 


5 6823984 


13 


2 


SEQ ID NO: 7749 | 


0 00 % 


3 1RR2R3 

•J a 10040J 


14 I 


HI 


QFO Tn MO- 77^0 1 


0 00 % 1 




1 R I 


135 


SEQ ID NO: 7751 | 


0 00 % 
UiUU /o 




16 | 


76 


SEQ ID NO: 7752 


0.00 % 


2.005692 


17 1 


23 


SEQ ID NO: 7753 


0.00 % 


1.798209 


18 | 


40 


SEQ ID NO: 7754 


0.00 % 1 


1.68996456 


19 1 


39 


SEQ ID NO: 7755 


0.00 % 


1.516482 


20 1 


118 


SEQ ID NO: 7756 


0.00 % 


1.2683304 : 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


91 


I SEQ ID NO: 7757 


2.77 % 


1 



HLA A 1101 - 10 mers | 


Maximum possible score using this molecule type 


36 I 


Rank 


Start position 


Sequence j[ 


% of max. score 


Score | 


1 


101 


SEQ ID NO: 7758 \ 


33.33 % 


12 I 


2 


71 


SEQ ID NO: 7759 \ 


2.77 % 


1 I 


3 


90 


SEQ ID NO: 7760 H 


1.66 % 


0.6 1 




HLA B7 - 9 mers 


Maximum possible score using this molecule type 


1 5400 


Rank | 


1 Start position | 


Sequence 


% of max. score 


| Score 


i l 


1 49 | 


SEQ ID NO: 7761 || 


2.22 % 


I 120 


2 1 


1 9 | 


SEQ ID NO: 7762 | 


1.11 % 


1 60 


3 1 


1 73 1 


SEQ ID NO: 7763 | 


0.66 % 


I 36 


4 


I 33 | 


SEQ ID NO: 7764 | 


0.37 % 


1 20 


5 


I 137 


1 SEO ID NO: 7765 II 


0.37 % 


1 20 


6 I 141 


SEQ ID NO: 7766 |j 


0.37 % 


I 20 


7 1 77 


I SEQ ID NO: 7767 |j 


0.22 % 


1 12 
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8 1 


112 


SEQ ID NO: 7768 1 


0.22 % 


12 J 


9 ! 


143 


SEQ ID NO: 7769 


0.22 % 




10 1 


81 


SEQ ID NO: 7770 1 


0.14 % 


8 "1 


11 1 

XX { 


13 I 


SEQ ID NO: 7771 [ 


0.09 % 


5 I 


12 1 

X^. | 


69 I 


SEQ ID NO: 7772 | 


0.09 % 


5 I 


13 


18 1 


SEQ ID NO: 7773 I 


0.07 % 


4 I 


14 i 


32 | 


SEQ ID NO: 7774 j 


0.07 % 


4 J 


15 I 


54 | 


SEQ ID NO: 7775 | 


0.07 % 


4 i 


16 1 

xO | 


80 1 SEQ ID NO: 7776 [ 


0 07 % 


4 ! 


17 1 


92 I SEQ ID NO: 7777 | 


0.07% 


4 I 


18 | 


108 


SEQ ID NO: 7778 I 


0.07 % 


4 


19 1 


111 


SEQ ID NO: 7779 1 


1 0.07 % 


4 


20 


123 


SEQ ID NO: 7780 I 


0.07 % 


4 




HLA B7 - 10 mers 


Maximum possible score using this molecule type 


5400 


rvdiiiv 


Start position I 


Sequence | 


p/ n f\f may crnrp 


Score 


i 


99 1 


! SEQ ID NO: 7781 | 


0.74 % 


40 


2 


17 I 


1 SEQ ID NO: 7782 I 


j 0:37 % 


20 


3 


8 1 


I SEQIDNO:7783 


| 0.22 % 


. 12 


4 


72 1 


1 SEQ ID NO: 7784 I 


1 0.22 % 


12 


5 


91 1 


I SEQ ID NO: 7785 1 


I 0.22 % 


. 12 


6 




\ SEQ ID NO: 7786 | 


I 0.11 % 


6 


7 


1 Z \ 


[ SEQ ID NO: 7787 


! 0.07 % 


4 ! 


8 


32 


I SEQ ID NO: 7788 I 


1 0.07 % 


' 4 


9 


53 1 SEQ ID NO: 7789 I 


1 0.07 % 


4 


10 


76. 


SEQ ID NO: 7790 | 


1 0.07 % 


4 


11 


107 


SEQ ID NO: 7791 


1 0.07 % 


4 


12 


110 


SEQ ID NO: 7792 


I 0.07 % 


4 


13 


111 


SEQ ID NO: 7793 


i 0.07 % 


4 ! 


14 


140 


SEQ ID NO: 7794 


1 0.07 % 


4 


15 


9 


SEQ ID NO: 7795 


I 0.05 % 


3 


16 


19 


SEQ ID NO: 7796 


0.05 % 


3 


17 


33 


SEQ ID NO: 7797 


0.03 % 


2 


1 18 


1 93 


SEQ ID NO: 7798 


0.03 % 


2 


1 19 


102 


SEQ ID NO: 7799 


0.03 % 


2 


I 20 


129 


SEQ ID NO: 7800 


j 0.02 % 


1.5 
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Table 15: Epitopes for SEQ ID NO: 6041 



HLA Ai - 9 mers 


Maximum possible score using this molecule type 


5625 


Rank II Start position 


Seauence 


% of max. score 


Score 


1 H 1818 


SEQ ID NO: 7801 


1.6 % 


1 90 


2 H 373 


SEQ ID NO: 7802 


1.33 % | 75 


— — ~ II — 

3 || 681 


SEQ ID NO: 7803 


\ 1.33% 1 75 


4 || 74 


SEQ ID NO: 7804 


0.88 % 1 50 


5 || 786 


SEQ ID NO: 7805 


0.88 % || 50 


6 1495 


SEQ ID NO: 7806 


0.88 % || 50 


7 || 88 


SEQ ID NO: 7807 


0.8 % | 45 


~ 11 

8 || 357 


SEQ ID NO: 7808 


! 0.8% H 45 


9 1271 


SEQ ID NO: 7809 


0.8 % || 45 


10 1 1799 


SEQ ID NO: 7810 


0.8% |". 45 


11 II 1393 


SEQ ID NO: 7811 


0.48% 8 27 


12 U 386 


SEQ ID NO: 7812 


0.44% || . 25 


13 B . 2304 


SEQ ID NO: 7813 


0.44 % | 25 


14 || 198 


SEQ ID NO: 7814 


0.4 % 1 


! 22.5 


15 II 840 


SEQ ID NO: 7815 


i 0.4 % || . 22.5 


16 U 2359 


SEQ ID NO: 7816 


! 0.4% . I 


I 22.5 


17 


1 1194 


SEQ ID NO: 7817 


0.32 % 


1 18 


18 


1 1546 


SEQ ID NO: 7818 


0.32 % | 


18 


19 


2200 I 


SEQ ID NO: 7819 


0.22 % 


I 12.5 


20 


996 ! 


SEQ ID NO: 7820 


0.2 % 


11.25 




r HLA Al - 10 mere 


iMaximum possible score using this molecule type 


5625 




I Start position 


Sequence 


% of max. score 


Score 


1 


1 995 


SEQ ID NO: 7821 


10 % 


562.5 


2 


I 1303 


SEQ ID NO: 7822 


2.22 % 


125 


3 


I 1582 


SEQ ID NO: 7823 


2 % 


112.5 


4 


S 1456 


SEQ ID NO: 7824 


1.6 % 


90 


5 


1 772 


SEQ ID NO: 7825 


1.11 % 


62.5 


6 


[ 181 


SEQ ID NO: 7826 


0.88 % 


50 


7 


! 632 


SEQ ID NO: 7827 


0.88 % 


50 


8 


I 2281 


SEQ ID NO: 7828 


0.88 % 


50 


9 


1 1586 


SEQ ID NO: 7829 


0.8 % 


45 


10 


I 2109 


SEQ ID NO: 7830 


0.8 % 


45 


11 


i 745 


SEQ ID NO: 7831 


0.55 % 


31.25 
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1 12 


1 


1916 


II SEQ ID NO: 7832 II 


0.53 % 


1 30 


b 13 


| 


966 


I SEQ ID NO: 7833 fl 


0.44 % 


1 25 


14 


] 


1387 


I SEQ ID NO: 7834 II 


0.44 % 


25 


15 


1 


2263 


II SEQ ID NO: 7835 II 


0.44 % 


25 


16 


I 


2457 


H SEQ ID NO: 7836 II 


0.26 % 


15 


17 


I 


1057 


H SEQ ID NO: 7837 H 


0.22 % 


12.5 


18 


h 


2562 


II SEQ ID NO: 7838 II 


0.22 % 


12.5 


19 




74 


II SEQ ID NO: 7839 II 


0.17% 


10 


20 


1 


298 


II SEQ ID NO: 7840 II 


0.17 % 


10 




HLA A3 - 10 mers 


Maximum possible score using this molecule type 


12150 


Rank || Start position 


Sequence I 


% of max. score 


Score | 


1 II 392 


SEQ ID NO: 7861 1 


2.46 % 


300 | 


2 II 2230 II SEQ ID NO: 7862 1 1.48% 


180 
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1 3 I 


590 


SFO ID NO' 7863 


1 1 1 % 


ll 


4 1 


697 


SEO ID NO" 7864 


I 1 11 % 

1 X . X X /O 


135 


5 1 


Q19 

j x^ 


SEO ID NO' 7865 


n 74 % 




1 6 1 


13*54 
ijj ■ 


SEO ID NO' 7866 


n 74 % 


QO 


7 1 


i4^n 

ItJw 


SFO ID NO' 7867 

«^[^V^[ XLS IMUi / \J\J / 


H 74 OA 
u./'t /o 


I 


8 1 


7R34 


SFO ID NO' 7868 


n 74 oa 


1 QO 


9 I 




SFO ID NO* 78fiQ 


1 n 4Q OA 


fin 

OU 


1 n I 

JLU I 


r j.oo 

T'OO 


QFH TH MH- 7ft70 
DCy ILf l»V-/« /O/U 


n AQ OA 


AH 
DU 


1 1 I 




Cpn TH MH* 7fl71 
OC-Vd IvU. /Of x 


n 2iQ oa 






1 7^ 


QPH Tn MO- 7R77 


n aq oa ■* 
u.*+y vo 


OU 




77Q1 




n AO OA 

u.4y vo 


OU 


14 I 
14 1 


1 QQ4 


oty 1U IMU. /0/4 


n A/1 OA 

| U.44 vo . 


D4 


13 1 


7«^7 


cpn TH MO- 7Q7*C 


n >i>i oa 
U.*Kr vo . 


34 


16 | 


555 


SEO ID NO: 7876 


0.37 % : 


I. 45 


17 | 


1134 


SEQ ID NO: 7877 


0.37 % " 


• 45 


18 


1149 


SEQ ID NO: 7878 


! 0.29 % ■ 


' 36 


19 


283 | 


SEQ ID NO: 7879 


! 0.24 % ■ 


30 


20 


917 | 


SEQ ID NO: 7880 


0.24 % | 30 



HLA A24 - 9 mers 

Maximum possible score using this molecule type . II 1596.672 



Rank (| Start position 


Sequence 


% of max. score I 


Score 


i 


1 2375 


SEQ ID NO: 7881 


36.07% • 1 


576 


2 


1 1751 


SEQ ID NO: 7882 


28.93 % 


462 


3 


1 195 


SEQ ID NO: 7883 II 25.05 % . 1 


400 


4 


| 2306 | 


SEQ ID NO: 7884 H 21.04 % | 


336 


5 


i 806 i 


SEQ ID NO: 7885 | 20.66 % 


330 


6 


1252 ! 


SEQ ID NO: 7886 | 18.78 % 


300 


7 


160 


SEQ ID NO: 7887 j 15.03 % 


240 ! 


8 


517 


SEQ ID NO: 7888 j 15.03 % 


240 I 


9 


375 1 SEQ ID NO: 7889 II 12.52 % i 


200 I 


10 


1275 I SEQ ID NO: 7890 1 12.52 % 1 


200 


11 


2175 I SEQ ID NO: 7891 II 12.52 % ' 1 


200 


12 


2207 1 SEQ ID NO: 7892 | 12.52 % 


200 | 


1 13 


2343 | SEQ ID NO: 7893 H 12.52 % ! 


200 | 


1 14 


443 ! 


SEQ ID NO: 7894 | 11.27 % i 


180 | 


1 15 


668 i 


SEQ ID NO: 7895 


| 7.51 % ! 


120 


1 16 


1825 ! 


SEQ ID NO: 7896 I 


1 6.88 % | 


110 


1 17 


1690 i 


SEQ ID NO: 7897 I 


1 4.69 % 


75 
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1 18 


1 159 


SEO ID NO: 7898 


1 3.75 % 


60 


1 19 


1 2550 


SEQ ID NO: 7899 


1 3.75 % 


60 


I 20 


I 1949 


SEQ ID NO: 7900 


l~~ 3.38% 


54 




HLA A24 - 10 mers 


Maximum possible score using this molecule type 


1596.672 




1 start position 


Sequence 


1 % of max. score 


Score 


1 


541 


SEQ ID NO: 7901 


I 45.09 % 


720 1 


2 


BUS 


SEQ ID NO: 7902 


L 24.80 % 


396 


3 


i2U9 


SEQ ID NO: 7903 


i 22.54 % 


360 


A 

L ^ 




SEQ ID NO: 7904 


1 18.03 % 


1 288 1 


5 




SEQ ID NO: 7905 


\ 15.03 % [ 


1 240 


o 


coo 


SEQ ID NO: 7906 


1 15.03 % \ 


! 240 


i / 


/yy 


SEQ ID NO: 7907 


15.03 % 


240 


o 
O 




SEQ ID NO: 7908 


15.03 % I 


1 240 


9 


22iy 


SEQ ID NO: 7909 


15.03 % 1 


240 


10 


1065 


SEQ ID NO: 7910 


13.77 % 


220 


11 


1953 


SEQ ID NO: 7911 


! 13.15 % 


210 


12 


1966 


SEQ ID NO: 7912 I 


12.52 % . | 


200 


13 


2600 


SEQ ID NO: 7913 I 12.52 % I 


200 


14 


71 


SEQ ID NO: 7914 I 


1 9.39 % 


150 


15 


380 


SEQ ID NO: 7915 1 


i 9.39% | 


150 


16 


1989 


SEQ ID NO: 7916 1 


i 9.39 % 


150 


17 


342 


SEQ ID NO: 7917 i 


1 8.76 % 


140 


18 


1071 


SEQ ID NO: 7918 1 


8.76 % 


140 


19 


2570 


SEQ ID NO: 7919 1 


1 6.88 % 1 


no S 


20 


2550 


SEQ ID NO: 7920 I 


6.26 % | 


100 ij 



HLA A 0201 - 9 mers 



Maximum possible score using this molecu 


etype 


3925227.1 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


! 1632 


SEQ ID NO: 7921 


0.09 % 


3607.31448 


2 


1640 


SEQ ID NO: 7922 


0.04 % 


1748.2560912 


3 


1776 


SEQ ID NO: 7923 


0.03 % 


1492.58592 


4 


2512 


SEQ ID NO: 7924 


0.03 % 


1434.16845 


5 


1073 


SEQ ID NO: 7925 


0.03 % 


1338.876 


6 


230 


SEQ ID NO: 7926 


0.01 % 


685.78272 


7 


1001 


SEQ ID NO: 7927 


0.01 % 11 559.8936 


8 


716 


SEQ ID NO: 7928 


0.01 % 


! 558.27486 
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1 9 


2280 


SEQ ID NO 


7929 


1 0.01 % 


JL 511. 19781048 1 


10 


590 


SEQ ID NO 


7930 


0.01 % 


|| 469.6692 


r 11 


664 


SEQ ID NO 


: 7931 


0.01 % 


J|_442 . 076389524J 


12 


1094 


SEQ ID NO 


: 7932 


0.00 % 


HT 382.536 1 


13 


1735 


SEQ ID NO 


: 7933 


0.00 % 


382.536 1 


14 


1625 


SEQ ID NO 


: 7934 


0.00 % 


II 342.4606344 1 


15 


1974 


SEQ ID NO 


: 7935 


0.00 % 


H 336.885048 1 


16 


2382 


SEQ ID NO 


: 7936 


0.00 % 


J 319.9392 ! 


17 


2417 


SEQ ID NO 


• 7937 


0.00 % 


1 319.9392 ! 


18 


744 


SEQ ID NO: 


7938 


0.00 % 


II 256.416670125 1 


19 


108 


SEQ ID NO 


7939 


0.00 % 


1 232.52724 1 


20 


390 


SEQ ID NO 


■ 7940 


0.00 % 


|| 228.0411084 | 



HLA A 0201 - 10 mere | 


Maximum possible score using this molecule type 


3925227.1 | 


Rank 


Start position 


Sequence 


°/o of max. score 


Score 


1 


2511 


<?EO ID NO* 7Q41 


Ui JU /O j 


15126.90795 [ 


2 


1608 


SEQ ID NO: 7942 


0.05 % 


i 2049 4(556 "I 


3 


2572 


SEQ ID NO: 7943 


0.04 % 1 


1879.5921264 I 


4 


255 


SEQ ID NO: 7944 


0.03 % | 


1566.6522795 1 


5 


895 


SEQ ID NO: 7945 


0.03 % | 


1338.876 


6 


1171 


SEQ ID NO: 7946 


0.02 % i 


1107.960876 


7 


1691 


SEQ ID NO: 7947 


0.01 % 


782.95521024 1 


8 


20 


SEQ ID NO: 7948 


0.01 % | 


549.9372312 1 


9 


1632 i 


SEQ ID NO: 7949 


0.01 % ! 


479.041993296 I 


10 


2280 


SEQ ID NO: 7950 


0.01 % i 


472.418344576987| 


11 


1963 


SEQ ID NO: 7951 


o.oo % ! 


358.73928 


12 


1955 1 


SEQ ID NO: 7952 


o.oo % ! 


331.093464 


13 


741 


SEQ ID NO: 7953 


0.00 % I 


318.652488 I 


14 


523 


SEQ ID NO: 7954 


0.00 % 


278.7876 | 


15 


1073 


SEQ ID NO: 7955 


0.00 % I 


266.6988828 i 


16 


2489 


SEQ ID NO: 7956 


0.00 % 1 


243.432 | 


17 


777 


SEQ ID NO: 7957 


0.00 % 


218.5730664 i 


18 


1737 


SEQ ID NO: 7958 


0.00 % 1 


218.0785572 


19 


589 


SEQ ID NO: 7959 


0.00 % 1 


210.538251 


20 


229 | 


SEQ ID NO: 7960 


0.00 % i 


205.230564 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 
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1 Rank B Start position 


Sequence 


°/o of may score 


1 Score 


1 
1 


2337 


SEQ ID NO: 7961 


33 33 % 


12 


-> I 
2 


2156 1 SEO ID NO: 7962 


9^ % 


q 


3 






1 £m\J /O 


7 2 


4 


18 


SEQ ID NO: 7964 


16 66 % 

IUiUU /u 


6 


i> \ 


TOT 

332 


SEQ ID NO: 7965 


1 fi fifi % 


o 


6 I 


415 


SEQ ID NO: 7966 


1 fi 66 % i 


fi 


7 \ 


2479 


SEQ ID NO: 7967 


1 fi fifi o/ n I 

10.00 /O | 


fi 
o 


8 i 


1495 


SEQ ID NO: 7968 


1 1 1 1 % I 
11.11 /O | 


A 
r 




2035 


SEQ ID NO: 7969 


11 11 OL I 
11.11 /O 


A 
*+ 


10 


1349 


1 SEQ ID NO: 7970 


in o/ ft 

lU /O j 


^ fi 


11 


1194 


SEQ ID NO: 7971 


O.JJ /o 




12 


1648 


SEQ ID NO: 7972 


Q OA. 
O.JJ /o 


•a 


1 J 


96 


SEQ ID NO: 7973 


fi fifi o/. . 
o.oo /o 


9 A 


14 


764 


SEQ ID NO: 7974 


fi fifi % 

0«00 /o 




15 


986 


SEQ ID NO: 7975 


fi fifi % 

O.UO /O 


9 4 


16 


2345 


SEQ ID NO: 7976 


fi fifi O/i 
O.OO /O 


9 4 


17 


698 


SEQ ID NO: 7977 


5.55 % 


2 


18 


1355 


SEQ ID NO: 7978 


5.55 % 


2 


19 


1987 


SEQ ID NO: 7979 


5.55 % 


2 


20 


2085 


SEQ ID NO: 7980 1 


1 5.55% 


2 




^ HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


1 Sequence 


% of max. score 


Score 


! l 


2083 


I SEQ ID NO: 7981 


33.33 % 


12 


2 


2123 


I SEQ ID NO: 7982 


25 % 


9 


3 


2147 


SEQ ID NO: 7983 


16.66 % 


6 


4 


331 


SEQ ID NO: 7984 


12.5 % 


4.5 


5 


1035 


SEQ ID NO: 7985 


11.11 % 


4 


6 


1064 


SEQ ID NO: 7986 


11.11 % 


4 


7 


2154 


SEQ ID NO: 7987 


11.11 % 


4 


8 


1048 


SEQ ID NO: 7988 


7.5 % 


2.7 


9 


202 


SEQ ID NO: 7989 


6.66 % 


2.4 


1 io 


721 


SEQ ID NO: 7990 


6.66 % 


1 2.4 


1 11 


2109 


SEQ ID NO: 7991 


6.66 % 


1 2.4 


1 12 


2230 


SEQ ID NO: 7992 


6.66 % 


I 2.4 


13 


1306 


SEQ ID NO: 7993 


5.55 % 


1 2 


14 


1622 


SEO ID NO: 7994 


5.55 % 


1 2 
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1 R 


1772 H SEO ID NO: 7995 


5.55 % | 2 


1fi 


1796 


SEQ ID NO: 7996 


5.55 % | 2 


17 


186 


SEQ ID NO: 7997 


5 % 


1.8 


18 


414 


SEQ ID NO: 7998 


5 % 


1.8 


19 


697 


SEQ ID NO: 7999 


5 % 


I 1.8 


20 


1175 


SEQ ID NO: 8000 


5% 


! 1.8 




HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 




Start position 


Sequence II % of max. score 


Score 


l 1 


1447 


SEQ ID NO: 8001 II 14.81 % 


800 


2 


642 


SEQ ID NO: 8002 I 3.70 % 


200 




34 


SEQ ID NO: 8003 fl 2.22 % 


120 


1 

4 I 


186 


SEQ ID NO: 8004 H 1.48 % 


/ 80 


5 I 


244 | 


SEQ ID NO: 8005 R 1.48 % 


80 


6 I 


459 


SEQ ID NO: 8006 II 1.48 % 


80 


7 | 


1475 1 


SEQ ID NO: 8007 | 


! 1.48 % 


80 


8 1 


1867 | 


SEQ ID NO: 8008 | 


! 1.48% 


80 


9 I 


2032 


SEQ ID NO: 8009 I 


I 1.48 % 


80 


10 I 


2047 || SEQ ID NO: 8010 | 


! 1.48 % 


80 


11 ■ 1 


2335 II SEQ ID NO: 8011 | 


1.48 % 


80 


12 | 


622 || SEQ ID NO: 8012 I 


1.11 % 


60 


13 I 


1375 I SEQ ID NO: 8013 | 


1.11 % 


60 


14 


1617 1 SEQ ID NO: 8014 | 


0.92 % 


50 


15 


1023 1 SEQ ID NO: 8015 | 


0.83 % 


45 


16 


286 I SEQ ID NO: 8016 | 


0.74 % 


: 40 


17 


490 II SEQ ID NO: 8017 | 


0.74 % 


40 


18 


810 II SEQ ID NO: 8018 ! 


0.74 % 


40 


19 


1420 II SEQ ID NO: 8019 I 


0.74 % 


40 


20 


1854 || SEQ ID NO: 8020 | 


0.74 % 


40 




1 HLA B7 - 10 mers 


Maximum possible score using this molecule type 


5400 


1 Rank 


Start position 


1 Sequence 


I % of max. score 


Score 


1 1 


1617 


I SEQ ID NO: 8021 


\ 3.70 % 


200 


1 2 


752 


1 SEQ ID NO: 8022 


2.22 % 


120 


1 3 


1552 


I SEQ ID NO: 8023 


2.22 % 


120 


1 4 


154 


1 SEQ ID NO: 8024 


1.48 % 


80 


I 5 165 


I SEQ ID NO: 8025 


I 1.48 % 


80 
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|i 6 


383 


1 SEO ID NO: 8026 


r~ i 48 % 


1 80 


7 


1501 


1 SEO ID NO: 8027 


! 1 48 % 


1 80 


J 8 


2093 


I SEO ID NO: 8028 


I 1 48 % 


1 80 


1 9 


2564 


1 SEO ID NO: 8029 


1 1 48 % 


1 80 


1 10 


622 


1 SEO ID NO" 8030 


i 1 1 1 % 

| 1.11 /o 


1 60 


11 


1086 
•l \j \j \j 


SEO ID NO" 8031 


1 11 o/ Q 

1 X • X x / o 


1 60 1 
1 ou | 


12 


1262 


SEO ID NO" 8032 

JLy XL/ lilvi \J\J *J 


1 1 1 1 % 

j X . 1 1 /O 


1 60 1 


13 

1 *J 


1556 


SFO ID NO" 8033 


I 1 1 1 % 

| 1.11 /o 


1 60 1 
I ou | 


14 


R4R 


^FH TH NO' 8034 


I 1 /O 


1 i>4 | 


1 q 

X +J 


2R6 


SFO TD NO* R03R 


I U./H- /0 


1 4U | 


16 

x. %/ 


490 


SFO TD NO - 8036 

iJUy XL/ 1 IV ■ UU JU 


1 0 74 % 


1 40 1 


17 


552 


SEQ ID NO: 8037 


1 0.74 % 


1 40 1 


18 


1858 


SEQ ID NO: 8038 


0.74 % 


1 40 I 


19 


2107 


SEQ ID NO: 8039 


| 0.74 % 


1 40 | 


20 


2582 


SEQ ID NO: 8040 


I 0.74 % 


1 40 1 


Table 16: Epitopes for SEQ ID NO: 6042 


HLA Al - 9 mers 


Maximum possible score using this molecule type 


5625 


Rank || Start position 


1 Sequence 


% of max. score 


Score 


1 II 846 


SEQ ID NO: 8041 


2.22 % 


125 


2 1 798 


SEQ ID NO: 8042 


1.6% . 


90 


3 


787 


SEQ ID NO: 8043 


0.88 % 


50 


4 I 


1178 


SEQ ID NO: 8044 


0.88 % 


50 ! 


5 I 


637 


SEQ ID NO: 8045 


0.8 % 


45 


6 I 


557 


SEQ ID NO: 8046 


0.44 % 


25 


7 I 


1020 


SEQ ID NO: 8047 


0.44 % 


25 


8 I 


282 


SEQ ID NO: 8048 


0.32 % 


18 


9 1 


1241 


SEQ ID NO: 8049 


0.24 % 


13.5 


10 || 466 


SEQ ID NO: 8050 


! 0.22 % 


12.5 


11 1 


727 


SEQ ID NO: 8051 


0.2% 


11.25 


12 | 


706 


SEQ ID NO: 8052 


0.17 % 


10 


13 I 


324 


SEQ ID NO: 8053 


0.16 % 


9 


14 I 


752 


SEQ ID NO: 8054 


0.16 % 


9 


15 I 


54 


SEQ ID NO: 8055 


0.13 % 


7.5 


16 ! 


554 


SEQ ID NO: 8056 


I 0.13 % 


7.5 


1 17 | 


590 


SEQ ID NO: 8057 


0.12 % 


6.75 
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18 


569 | 


SEQ ID NO: 8058 II 0.08 % 


5 


19 


613 


SEQ ID NO: 8059 \ 0.08 % 


5 


20 


90 


SEQ ID NO: 8060 1 0.08 % 


! 4.5 | 




[ HLA Al - 10 mere 


Maximum possible score using this molecule type 


5625 


Rank 


Start position ] 


Sequence 


% of max. score 


Score 


1 


1241 | 


[ SEQ ID NO: 8061 


4.8 % 


270 


2 


967 1 


SEQ ID NO: 8062 


0.8 % 


45 


3 


1010 ! 


SEQ ID NO: 8063 


0.48 % 


27 


4 


426 | 


SEQ ID NO: 8064 


0.44 % 


25 


5 


809 | 


SEQ ID NO: 8065 


0.44 % 


25 


6 


1178 | 


SEQ ID NO: 8066 


0.44 % 


I 25 


7 


— TO ! 

787 1 


SEQ ID NO: 8067 


0.22 % 


12.5 


8 


? 958 [ 


SEQ ID NO: 8068 


i 0.22 % 


12.5 


9 


727 | 


SEQ ID NO: 8069 


\ 0.2 % 


11.25 


10 


610 


SEQ ID NO: 8070 


0.17 % 


10 


11 


1 12 I 


I SEQ ID NO: 8071 


0.13 % 


7.5 


12 


1181 ! 


1 SEQ ID NO: 8072 


0.12 % 


6.75 


13 


373 


SEQ ID NO: 8073 


0.11 % 


6.25 


14 


1 602 


SEQ ID NO: 8074 


0.11 % 


6.25 


15 


I 20 


SEQ ID NO: 8075 


0.04 % 


2.5 


16 


32 


SEQ ID NO: 8076 


0.04 % 


2.5 


17 


53 


SEQ ID NO: 8077 


0.04 % 


2.5 


18 


400 


SEQ ID NO: 8078 


0.04 % 


2.5 


19 


557 


SEQ ID NO: 8079 


0.04 % 


2.5 


20 


667 


SEQ ID NO: 8080 H 0.04 % 


2.5 



I HLA A3 - 9 mere I 


Maximum possible score using this molecule type 


I 12150 | 


Rank 


Start position 


|_ Sequence 


! % of max. score 


I Score | 


1 


I 768 


1 SEQ ID NO: 8081 


0.82 % 


1 100 I 


2 


1 808 


I SEQ ID NO: 8082 


0.49 % 


1 60 | 


3 


85 


I SEQ ID NO: 8083 


0.24 % 


1 30 | 


4 


663 


1 SEQ ID NO: 8084 


0.24 % 


i 30 | 


5 


1245 


1 SEQ ID NO: 8085 


0.14 % 


1 18 | 


6 


288 


1 SEQ ID NO: 8086 


0.09 % 


1 12 I 


7 


50 


I SEQ ID NO: 8087 


0.08 % 


1 io I 


8 


320 


1 SEQ ID NO: 8088 


0.07 % 


1 9 | 
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g 


402 


SEQ ID NO: 


: 8089 


0.07 % 


9 


10 


! 798 


SEO ID NO: 


: 8090 


0.07 % 


g 


11 

JL JL 


902 


SEO ID NO 


: 8091 


0 06 % 


8.1 


12 


i 364 


SEO ID NO: 


: 8092 


0.05 % 


6;75 


13 


297 


SEO ID NO' 


■ 8093 

1 \J\J ■/ »S 


0 04 % 


! 6 


14 


992 

-7 -7 fc- 


SEO ID NO 


: 8094 


0 04 % 


g 




38 


SEO ID NO 1 


* 8095 


0 03 % 


4 5 


16 


249 


SEO ID NO' 


' 8096 


0 03 % 


4 5 


17 


706 


SEQ ID NO: 


: 8097 


0.03 % 


4.05 


18 


1204 


SEQ ID NO. 


: 8098 


0.03 % 


4.05 


19 


I 1178 


SEQ ID NO. 


: 8099 


0.03 % 


4 


20 


343 


SEQ ID NO. 


: 8100 


0.02 % 


3.6 




HLA A3 - 10 mere 


Maximum possible score using this molecule type 


lZXDU 


Rank 


start position 


Sequence 


% of max. score 


Score 


1 


255 


SEQ ID NO: 8101 


1.48 % 


lot) 


2 


180 


SEQ ID NO: 8102 


0.55 % 


67 5 


3 


768 


SEQ ID NO: 8103 


0.49 % 


60 


4 


1177 


SEQ ID NO: 8104 


0.49 % 


! 60 


5 


380 


SEQ ID NO: 8105 


0.24 % 


30 


6 


100 


SEQ ID NO: 8106 


0.18 % 


22.5 


7 


786 


SEQ ID NO: 8107 


0.16% 


20 


8 


1217 


SEQ ID NO: 8108 


0.16 % 


20 


9 


207 


SEQ ID NO: 8109 


0.14 % 


18 


10 


1183 


SEQ ID NO: 8110 


0.14 % 


18 


11 


38 


SEQ ID NO: 8111 


0.09 % 


12 


12 


52 


SEQ ID NO: 8112 


0.09 % 


12 


13 


8 


SEQ ID NO: 8113 


0.06 % 


8 


14 


679 


SEQ ID NO: 8114 


0.06 % 


8 


15 


73 


SEQ ID NO: 8115 I 0.05 % 


6.75 


16 


1204 


SEQ ID NO: 8116 II 0.05 % 


6.075 


17 


50 


SEQ ID NO: 8117 1 0.04 % 


6 


18 


774 


SEQ ID NO: 8118 || 0.04 % 


6 


19 


845 


SEO ID NO: 8119 


! 0.04 % 


6 


20 


214 


I SEQ ID NO: 8120 B 0.04 % 


5.4 



HLA A24 - 9 mere 

Maximum possible score using this molecule type f 1596.672 
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Rank 


1 Start position 


Sequence 


°/o of max. score 


Score 


1 1 


1 1118 


SEQ ID NO: 8121 


19.84 % 


316.8 


2 


1 51 


SEQJD NO: 8122 


18.78 % 


300 


3 1 


I 161 


SEQ ID NO: 8123 


18.78 % 


300 


4 ! 

T 1 


1 434 


SEQJP NO: 8124 


18.78 % 


1 300 


5 1 


| 365 


SEQ ID NO: 8125 


13 .77 % 


! 220 


6 1 


1 736 


SEQ ID NO: 8126 


12.52 % ' 


! 200 


7 1 


| 620 


SEO ID NO: 8127 1 


7 51 % ! 


1 120 


8 I 


1 1068 


SEO ID NO" 8128 I 


7 51 % 

/ • _/ X /I) 


1 170 

| JL ^ U 


9 I 


! 817 


SEO ID NO- 8129 i 


3 75 % 


1 fiO 


10 I 


I 336 


SEQ ID NO: 8130 1 


3.44 % 1 


1 55 


11 1 


1 687 


SEQ ID NO: 8131 1 


3.13 % 


1 50 


12 || 254 


SEQ ID NO: 8132 1 


1 2.34 % 


! 37 5 


13 || 627 


SEQ ID NO: 8133 


I 1.87 % 


1 30 


14 ! 


) 950 


SEQ ID NO: 8134 I 1.75 % 


28 


15. | 


i 28 


I SEQ ID NO: 8135 


1.56 % 1 


25 


16 | 


i 408 


SEQ ID NO: 8136 


1.56 % I 


25 


17 | 


i 159 


SEQ ID NO: 8137 


1.31 % 


21 


18 | 


1 1166 


SEQ ID NO: 8138 1 


1.26 % 


20.16 


19 | 


45 


SEQ ID NO: 8139 1 


1.25 % 


20 


20 


185 


SEQ ID NO: 8140 1 


1.25 % 


I 20 




HLA A24 - 10 mers 


Maximum possible score using this molecule type j 


I 1596.672 


Rank 


Start position 


Sequence 


% of max. score 


. Score 


1 


438 


SEQ ID NO: 8141 


27.55 % 


440 


2 


489 


SEQ ID NO: 8142 


22.54 % 


360 


3 


254 


SEQ ID NO: 8143 


18.78 % 


300 


4 


354 


SEQ ID NO: 8144 


11.27 % 


180 


5 


406 


SEQ ID NO: 8145 


11.27 % 


180 


6 


1047 


SEQ ID NO: 8146 


11.27 % 


180 


7 


473 


SEQ ID NO: 8147 


7.51 % 


120 


8 


350 


SEQ ID NO: 8148 


6.26 % 


100 


9 


769 


SEQ ID NO: 8149 


6.26 % 


100 


10 


193 


SEQ ID NO: 8150 


5.63 % 


90 


11 


479 


SEQ ID NO: 8151 


3.13 % 


50 


12 


0 


SEQ ID NO: 8152 


2.70 % 


43.2 


13 


813 


SEQ ID NO: 8153 


1 1.87 % 


30 


14 


739 


SEQ ID NO: 8154 


1.50 % 


24 
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1 15 


782 


SEO ID NO: 8155 


1.50 % 


II 24 


1 16 


1 1186 


SEO ID NO: 8156 


1.31 % 


1 21 


17 


I 910 


SEO ID NO: 8157 


1.05 % 


1 16.8 


18 


1 128 


SEO ID NO: 8158 


0.93 % 


1 15 


19 


1 183 


SEQ ID NO: 8159 


0.93 % 


i 15 


20 


i 1069 


SEQ ID NO: 8160 


0.93 % 


1 15 




HLA A 0201 - 9 mere J 


Maximum possible score using this molecule type 


3925227.1 | 


Rank! 


Start position || 


Sequence H % of max. score 


Score 


1 1 


1041 II SEQ ID NO: 8161 || 


0.01 % 


AO A 097fmO 1 

4o4.2379//o | 


2 I 


981 1 SEQ ID NO: 8162 || 


r\ r\f\ ft/ 

0.00 % 




3 1 


957 j SEQ ID NO: 8163 H 


0.00 % 


^vio a cr\cy a a '1 

34Z.460D344 


4 


896 II SEQ ID NO: 8164 || 


f\ r%f\ ft/ 

0.00 % 


232. 6931712 • 


5 1 


1173 II SEQ ID NO: 8165 || 


r\ r\f\ ft/ 

0.00 % 


Z01.4474JZ 


6 1 


733 II SEQ ID NO: 8166 H 


0.00 % 


■i dearie 

• 171.86790 


7 | 


410 II SEQ ID NO: 8167 || 


0.00 % 


! *ioir A.c^a 


8 1 


786 II SEQ ID NO: 8168 j 


0.00 % 


1 lift A'C3f\-t "i 

1 119.4oj01z 


9 1 


150 I SEQ ID NO: 8169 \ 


0.00 % 


1 102.17550222 


1 10 II 1 1 SEQ ID NO: 8170 H 


0.00 % 


1 94.98737754 


1 11 1 


\ 595 § SEQ ID NO: 8171 || 


0.00 % 


1 93.239424 


1 12 1 


I 1095 8 SEQ ID NO: 8172 || 


0.00 % 


89.41779 


1 13 


I 1166 II SEQ ID NO: 8173 H 


0.00 % 


87.58584 


I 14 | 845 J SEQ ID NO: 8174 1 


0.00 % 


79.642008 


1 15 1 


| 734 II SEQ ID NO: 8175 || 


0.00 % 


73.47672 


1 16 II 802 1 


SEO ID NO: 8176 || 


0.00% 


71.872056 


1 17. 


r 1213 nr 


SEO ID NO: 8177 H 


0.00 % 


I 71.872056 


1 18 


1 105 II SEQ ID NO: 8178 fi 


0.00 % 


50.232 


1 19 


1 939 II 


SEO ID NO: 8179 II 


0.00 % 


49.13352 


1 20 


1 130 || 


SEQ ID NO: 8180 || 


0.00 % 


48.732354 




1 




HLA A 0201 - 10 mers 


1 


IMaximum possible score using this molecule type | 


3925227.1 I 


|Rank 


I Start position || 


Sequence II % of max. score | 


Score | 


1 1 


I 372 II SEQ ID NO: 8181 II 


0.04 % 1 


1896.33528 I 


1 2 


1 410 II SEO ID NO: 8182 || 


0.02 % 


1134.008497441 


1 3 


! 162 II SEO ID NO: 8183 R 


0.01 % 1 


685.3897512 | 


1 4 


1 1076 II SEO ID NO: 8184 1| 


o.oi % S 


640.90320525 1 


5 


I 1196 II SEO ID NO: 8185 II 


0.01 % I 


623.742666372 1 
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\J 


353 


SEO ID NO: 8186 


0.01 % 


446.7384576 


7 




SEO ID NO: 8187 


0.00 % 


375.97824 


R 
o 


733 


SEO ID NO: 8188 


0.00 % 


271.863864 


Q 


r ~ 1 30 


SEO ID NO: 8189 


0.00 % 


235.6873848 | 


1 n 


41 5 


SEO ID NO: 8190 


0.00 % 


185.679 


1 1 


297 


SEO ID NO: 8191 


0.00 % 


177.496704 


-LZ 


i 
j. 


*?FO TD NO' 81 Q2 


0 00 % 


152 42160582 




JU 


9FO TD NO* 81Q3 


0 00 % 


110.013876 


1 A 


/ 


c:FO TD NO* 81 Q4 

ULL< XL/ lvv/« U13^ 


0 00 % 


101 0988 


JLD 




SFO ID NO* 8195 

J wv£ XL/ liLIf UlyJ 


0 00 % 1 


98.26704 


16 


261 


SEO ID NO: 8196 


0.00 % 


91.60164 


1 17 


1040 


SEQ ID NO: 8197 


0.00 % 


76.98537 


1 18 


928 


SEO ID NO: 8198 


0.00 % 


71.2908 


1 19 


1188 


SEQ ID NO: 8199 


0.00 % 


69;81282 - 


1 20 


1094 


SEQ ID NO: 8200 


0.00 % 


52.5987 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 




H 

H. 


36 


Rank 


Start position 


Sequence 


% of max. score 


1 


Score 


1 


402 


SEQ ID NO: 8201 


25 % 




9 


2 I 


I 902 I 


SEQ ID NO: 8202 


22.5 % 




8.1 


3 | 


288 


1 SEQ ID NO: 8203 


11.11 % 


II 


4 


^ 4 | 


85 


SEO ID NO: 8204 


6.66 % 


I 


2.4 




! 706 


SEQ ID NO: 8205 


6.66 % 


I 


2.4 




1 456 


SEQ ID NO: 8206 


5.55 % 




2 




920 


SEQ ID NO: 8207 


5.55 % 




2 




535 


SEQ ID NO: 8208 


5% 




1.8 


9 I 


L 364 


SEQ ID NO: 8209 1 


3.33 % 




1.2 


10 I 


1 438 


SEQ ID NO: 8210 ! 


3.33 % 


1 


1.2 


11 1 


1 798 


SEQ ID NO: 8211 


3.33 % 




1.2 


12 


i 808 


SEQ ID NO: 8212 


3.33 % 


I 


1.2 


13 


1 937 


SEQ ID NO: 8213 


3.33% 




1.2 


14 


1 956 


SEQ ID NO: 8214 


3.33 % 




1.2 | 


15 


| 557 


SEQ ID NO: 8215 


2.77 % 


i 


1 1 


16 


1218 


SEQ ID NO: 8216 


2.77 % 


1 


1 | 


17 


784 


SEQ ID NO: 8217 


2.5 % 


1 


0.9 


18 


1 249 


SEQ ID NO: 8218 


2.22 % 


II 


0.8 


19 


! 768 


SEQ ID NO: 8219 


2.22 % 


1 


0.8 I 


20 


| 1178 


SEQ ID NO: 8220 


! 2.22 % 




0.8 I 
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HLA A 1101 - 10 mere 



Maximum possible score using this molecule type 


7A 


Rank 

IVQ 1 1 IV 


*5teirt nnciHon 




% of may score 


Score 


1 
1 


^8 


^FO TD NO* 8991 


1*J. JJ /o 


A Q 


7 


807 


JtV^ XL/ IMUt O^c^ 


19 R0/ n 


vl IT 

4.5 




1 nn 


ccn TH MO* 899"3 


11 11 o/, 


4 


A 
H 


^8n 


^FH Tn NO- 8994 I 


11 11 % 

11.11 /o 


4 




7A7 
/O/ 


QFH TH MO' 879^ I 


1 n oa 

1U /0 


3.6 


0 




ccn TH MO- R77A I 
jl^ XL/ IM V-/ • OZ.Z.D j 


O OA 


3 


7 


QA7 


QFH TH MO* R777 I 


O.OO /o 


\ 2.4 


o 
o 


Q1 Q 


ccn TH WO- R99H I 

OCV^ XL/ OZ.Z.O | 


c: crq oa 

3. DO /0 j 


2 


Q 


^ jUj 


QFH TH NO' R99Q I 


"s OA 


1.8 


1 n 

1U 




cpn Tn Nn- 87^n I 

OClJ_ XL/ WKJm O^OU f 


"2 'I'J OA 


1.2 


1 1 


M 1 


QFO Tn Nfi- 89*31 I 


7 77 OA 
J.JJ /o 


1.2 


1 7 
1Z 


1 1 77 

1 JL / / 


QFn Tn Nn- 8777 I 

Dl^ XL/ l\C/« OZjZ I 


7 77 OA - 
J.JJ /o 


1.2 j 






QFn Tn Nn- R7^^ I 


7 77 OA : 


* 

\ 1 


14 


758 


SEQ ID NO: 8234 ] 


2.77 % 


i 


15 


797 


| SEQ ID NO: 8235 | 


2.5% : 


0.9 


16 


255 


SEQ ID NO: 8236 


2.22 % 


0.8 


17 


986 


SEQ ID NO: 8237 


2.22 % . 


1 0.8 


18 


1157 


SEQ ID NO: 8238 


2.22 % : 


. 0.8 


19 


170 


SEQ ID NO: 8239 


1.66 % ? 


• 0.6 


20 


893 


SEQ ID NO: 8240 


1.66 % 


0.6 



HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank | 


Start position 


Sequence 


% of max. score 


Score 


1 I 


200 


SEQ ID NO: 8241 


1.48 % 


1 80 


2 | 


1243 


SEQ ID NO: 8242 


1.48 % i 


80 


3 I 


123 


SEQ ID NO: 8243 


0.74 % i 


40 


4 I 


248 


SEQ ID NO: 8244 


0.66 % 5 


36 


5 I 


1036 


SEQ ID NO: 8245 


0.66 % 


36 


6 I 


494 


SEQ ID NO: 8246 


0.37 % i 


20 


7 | 


495 


SEQ ID NO: 8247 


0.37 % ! 


20 


! 8 i 


523 


SEQ ID NO: 8248 


0.37 % I 


20 


9 ! 


842 


SEQ ID NO: 8249 


0.37 % I 


20 


10 


932 


SEQ ID NO: 8250 


0.37 % ! 


20 


11 1 


274 


SEQ ID NO: 8251 


0.33% II 18 | 
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12 1 


588 


SEQ ID NO: 8252 || 


0.22 % 


12 


13 1 


656 


SEQ ID NO: 8253 1 


0.22 % 


12 


14 1 


657 


SEQ ID NO: 8254 || 


0.22 % 


12 


15 1 


767 


SEQ ID NO: 8255 H 


0.22 % 


12 


16 1 


911 


SEQ ID NO: 8256 0 


0.22 % 


12 


17 1 


939 


SEQ ID NO: 8257 II 


0.22 % 


12 


18 | 


1007 


SEO ID NO: 8258 || 


0.22 % 


12 


19 | 


1170 


SEQ ID NO: 8259 || 


0.22 % 


12 


2Q_| 


1206 


SEQ ID NO: 8260 B 


0.22 % 


12 








HLA B7 - 10 mere 




1 


[Maximum possible score using this molecule type 


C/l ftft 1 

D4UU 1 




Start position I 


Sequence U_ 


°/o of max. score 


Score 1 


i 1 


1 505 


SEQ ID NO: 8261 || 


A A A O/ 

4.44 vo 


~>A ft 1 


2 1 


i 312 1 


i SEQ ID NO: 8262 II 


"3 "7ft 0/ 

S./\) vo 


Tftft 1 


..... . | 

3 I 


1 141 ! 


! SEQ ID NO: 8263 I 


1.11 VO 


Cft 


4 | 


1 1006 i 


I SEQ ID NO: 8264 8 


ft CC OA. 


oe 1 

JO | 


' 5 | 


i 411 ! 


I SEQ ID NO: 8265 fl 


0.44 % 


24 | 


6 I 


1 122 f 


I SEQ ID NO: 8266 B* 


0.37 % I 


1 20 | 




1 134 I 


I SEQ ID NO: 8267 || 


0.37 % 


20 | 


8 I 


1 184 | 


I SEQ ID NO: 8268 I 


0.37 % 


20 | 


9 I 


1 367 1 


S SEQ ID NO: 8269 || 


0.37 % 


20 J 


10 I 


| 402 : 


I SEQ ID NO: 8270 S 


0.37 % 


20 


11 


494 


I SEQ ID NO: 8271 | 


0.37% 


20 


12 ! 


1 560 


1 SEQ ID NO: 8272 II 


0.37 % 


20 




1 626 


1 SEQ ID NO: 8273 || 


0.37 % 


20 




1 931 


1 SEQ ID NO: 8274 II 


0.37 % 


20 


15 


1 956 


I SEQ ID NO: 8275 B 


0.37 % 


20 


16 


i 1117 


I SEQ ID NO: 8276 B 


0.37 % 


20 


17 


1169 H SEQ ID NO: 8277 | 


0.37 % 


20 


18 


1196 II SEQ ID NO: 8278 || 


0.37 % 


20 


19 


1 247 


SEQ ID NO: 8279 || 


0.22 % 


1 12 


20 


273 | SEQ ID NO: 8280 || 


0.22 % 


1 12 


Table 17: Epitopes for SEQ ID NO: 6043 



HLA Al - 9 mere 



Maximum possible score using this molecule type B 5625 
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1 Start position 


Sequence 


% of max. score 


Score 


1 


I 168 


SEQ ID NO: 8281 


0.2 % 


11.25 


2 


I 212 


SEQ ID NO: 8282 


0.08 % 


4.5 


3 


1 223 


SEQ ID NO: 8283 


0.08 % 


4.5 


1 4 

• 


f 104 


SEQ ID NO: 8284 


0.04 % 


2.5 


5 
*j 


170 


SEQ ID NO: 8285 


0.04 % 


2.5 


6 


99 


SEQ ID NO: 8286 


0.04 % 


2.25 


7 


188 


SEQ ID NO: 8287 


0.02 % 


1.35 


o 


180 


SEQ ID NO: 8288 


0.02 % 


1.25 


q 


219 


SEQ ID NO: 8289 


0.02 % 


1.25 ! 


10 

XV/ 


18 


SEQ ID NO: 8290 


0.01 % 


1 1 


1 1 

X X 


226 


L SEQ ID NO: 8291 


0.01 % 


1 1 


X,£m 


98 


[ SEQ ID NO: 8292 


0.01 % 


0.625 


X»J 


151 


SEQ ID NO: 8293 


0.01 % 


0.625 


14 

X"T 


10 


SEQ ID NO: 8294 


0.01 % 


0.6 


15 


1 13 


SEQ ID NO: 8295 


0.00 % 


0.5 


16 


32 


SEQ ID NO: 8296 


0.00 % 


0.5 


17 


70 


SEQ ID NO: 8297 


0.00 % 


0.5 


18 


78 I SEQ ID NO: 8298 


0.00% 


0.5 


19 


1 82 


SEQ ID NO: 8299 


0.00 % 


0.5 


20 


1 145 


SEQ ID NO: 8300 


0.00 % 


0.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type 




5625 




I Start position 


Sequence 


% of max. score 


Score 


1 II 99 


SEQ ID NO: 8301 


r - o.8 % 


45 


2 223 


SEQ ID NO: 8302 


1 0.8 % 


45 


3 || 188 


SEQ ID NO: 8303 


0.48 % 


27 


4 II 206 


SEQ ID NO: 8304 


0.2 % 


11.25 


5 II 253 


SEQ ID NO: 8305 


0.17 % 


10 


6 II 174 


SEQ ID NO: 8306 


0.13 % 


7.5 


7 ft 97 


SEQ ID NO: 8307 


[ 0.04 % 


2.5 


8 II 257 


SEQ ID NO: 8308 


0.04 % 


2.5 


9 II 179 


SEQ ID NO: 8309 


0.04 % 


2.25 


10 II 162 


SEQ ID NO: 8310 


0.02 % 


1 1.25 


11 II 196 


SEQ ID NO: 8311 


0.02 % 


1.25 


12 || 219 


SEQ ID NO: 8312 ft 0.02 % 


1.25 


13 


1 18 


SEQ ID NO: 8313 


0.01 % 


1 


14 


1 246 


SEQ ID NO: 8314 


0.01 % ! 1 
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15 


38 


SEQ ID NO: 8315 


0.01% B 0.75 


16 


33 


SEQ ID NO: 8316 


0.00% B 0.5 


17 


69 


SEQ ID NO: 8317 


0.00 % 


0.5 


18 


81 


SEQ ID NO: 8318 


0.00 % 


0.5 


19 


104 


SEQ ID NO: 8319 


i 0.00 % 


0.5 


20 


116 


SEQ ID NO: 8320 


0.00 % 


0.5 




r HLA A3 - 9 mers 


Maximum possible score using this molecule type 


12150 


Rank H Start position 


Sequence 


% of max. score 


Score 


1 I 104 


SEQ ID NO: 8321 


0.98 % 


120 


2 | 123 


SEQ ID NO: 8322 


0.74 % 


90 


3 B 82 


SEQ ID NO: 8323 


f 0.44 % 


54 


4 i 106 


SEQ ID NO: 8324 


0.11 % 


HOC 

13.5 


5 | 99 


SEQ ID NO: 8325 


0.08 % 


10.8 


6 ! 


127 


SEQ ID NO: 8326 


0.08 % ' 


10 


7 I 


71 


SEQ ID NO: 8327 


0.07 % 


9 


hH 




SEQ ID NO: 8328 


0.06 % 


. ft 1 




113 


• SEQ ID NO: 8329 fl 0.04 % 


6 


10 I 


84 


SEQ ID NO: 8330 


0.03% 


4.5 


11 | 


109 


SEQ ID NO: 8331 


0.03 % 


4.05 


12 | 


58 


SEQ ID NO: 8332 


0.02 % 


3 


13 | 


138 


SEQ ID NO: 8333 


0.02 % 


3 


14 | 


44 


SEQ ID NO: 8334 


0.02% 


2.7 


15 | 


81 


SEQ ID NO: 8335 


0.02 % 


2.7 


16 | 


226 


SEQ ID NO: 8336 


0.02 % 


2.7 


17 | 


184 


SEQ ID NO: 8337 


0.01 % 


1.8 


18 | 


102 


I SEQ ID NO: 8338 


0.01 % 


1.215 


19 I 


39 


SEQ ID NO: 8339 


0.00 % 


1.2 


20 


234 


SEQ ID NO: 8340 


0.00 % 


0.9 



HLA A3 - 10 mers 


Maximum possible score using this molecule type 


12150 


Rank | 


Start position 


Sequence 


% of max. score 


Score 


i 1 


99 


SEQ ID NO: 8341 


1.33 % 


162 


2 1 


81 


SEQ ID NO: 8342 


I 0.44 % 


54 


3 1 


104 


SEQ ID NO: 8343 


! 0.24 % 


30 


4 1 


51 


SEQ ID NO: 8344 


0.16 % 


20 


5 I 


122 


SEQ ID NO: 8345 


0.11 % 


13.5 



■4T7- 



WO 2004/092360 



PCT/US2004/011710 



6 1 


1 71 

1 ' M 


SEQ ID NO: 8346 


! 0.07 % 


9 


7 | 


1 69 


SEQ_ID NO: 8347 


0.04 % 


I 6 


8 1 


! 223 


SEQ ID NO: 8348 


1 0.04 % 


5.4 


9 1 


1 84 


SEO ID NO: 8349 


1 0.03 % 


4.5 


10 1 


! 63 


SEO ID NO: 8350 


1 0.02 % 


3.6 


11 


I 138 


SEO ID NO: 8351 


1 0.02 % 


1 


1 3 


12 II 201 


SEO ID NO" 8352 


I 0.01 % | 


| 1.8 


13 


! 44 


SEO ID NO' 8353 


i 0.01 % ! 


| 1.35 


14 


1 83 


SEO ID NO' 8354 


1 0.01 % | 


! 1.35 


15 


! 116 


SEO ID NO* 8355 


i 0.00 % 


1 


1 1-2 " 


16 


I 46 


SFO TD NO* 8356 


0.00 % 


1 


1 0.9 


17 1 183 


SEQ ID NO: 8357 


| 0.00 % 


I 0.81 


18 


1 57 


SEQ ID NO: 8358 


1 0.00 % 


0.6 


19 


1 93 


SEQ ID NO: 8359 


0.00 % 


0.6 


20 


I 113 


SEQ ID NO: 8360 


0.00 % 


0.6 




HLA A24 - 9 mere 


Maximum possible score using this molecule type • I 


1596.672 


Rank 


Start position 


Sequence 


% of max. score | 


Score 


i 


198 


SEQ ID NO: 8361 


13.15 % | 




210 


2 1 


105 


SEQ ID NO: 8362 


9.39 % I 


i 


150 


3 1 


210 


SEQ ID NO: 8363 


4.69 % 1 


I' 


75 




1 75 


SEQ ID NO: 8364 


3.15 % 




50.4 




| 85 


SEQ ID NO: 8365 


2.63 % ! 


1 42 


6 1 205 


SEQ ID NO: 8366 


2.10 % 


1 33.6 


7 I 77 


SEQ ID NO: 8367 


1.87 % 


! 30 ! 




I 158 


SEQ ID NO: 8368 


0.65 % 


! 10.5 


-H 


i 103 


SEQ ID NO: 8369 


0.56 % | 


1 9 


io ! 


S 227 


SEQ ID NO: 8370 


0.55 % 


! 8.8704 


ii 1 


1 32 I 


SEQ ID NO: 8371 


0.54 % 


1 8.64 


12 


1 74 


SEQ ID NO: 8372 


0.50 % | 


1 8 




1 131 


SEQ ID NO: 8373 


0.50 % 




8 


14 


| 54 II SEO ID NO: 8374 


0.46 % 




7.5 


1 15 


1 99 I 


SEQ ID NO: 8375 


0.45 % 


1 


7.2 


1 16 


| 44 II SEO ID NO: 8376 


0.37 % 


6 


1 17 


1 62 I 


SEQ ID NO: 8377 


0.37 % 




6 


1 18 


1 87 I 


SEO ID NO: 8378 


0.37 % 


1 


6 


1 19 I 89 I 


SEO ID NO: 8379 i 


0.37 % 


1 


6 


1 20 


1 154 I 


SEO ID NO: 8380 H 


0.37 % 


1 


6 
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HLA A24 - 10 mere 


Maximum possible score using this molecule type 


1596.672 | 


1 Danlr II 
| KdllK || 


Old ll pOalLlwll 




°/a off may score 


Score | 


1 I! 
i II 


1 f\K 


cpo in MO 1 8*381 I 

OCy IU IMw. OJOl | 


77 ca o/ ft 


360 1 


j 1 

£ II 


7 OA 


CPH Tn NfV 8*389 i 


X/iJJ /o 


280 1 


1 

■3 1 


zuy 


OC^ AU IMU. OjOj i 


*3 1 *3 OA 


50 


A II 

4 1 


7C 
/D 


CPO TH MO* Q*3QA I 
OfZKl IU IMU. OJO^ | 


1 517 OA 
x.o/ vO 


30 1 


5 || 


QC 
OJ 


CPO TO MH' ft^R*^ I 
Jty 1U IMU. OjOj J 


1 87 OA 
X.O/ /o 


^0 I 


6 II 


-7-7 
77 


CPO TO MO« QQQfi I 
otv^ ±U IMU . ojOO | 


1 1 9 OA 


18 1 

— 


7 1 
7 1 


7A 

74 


CPO TO Mfl' P.*3Q7 I 
Oty IU IMU. 000/ J 


n 534 oa 

[ U.OH" /0 


1 1 44 


» \ 


0 1 n 
Z1U 


CPO TO MO- CTQQ I 
. oty ID IMU. OJOO j 


n o/_ 

| U.DO /O 


g 


9 1 


ZZo 


CPO TO MO- QQQQ I 

btvi IU IMU. oj$oy I 


I n oa 

U.JJ VO 


8 8704 






CPO TO MO- 8*300 I 

Ol^ IU IMU. OjjU I 


I n ^4 oa 

l U ■ J*T /O 


8 fi4 


11 I 


1 QQ 


CPO TO MO- 5**301 I 


I n 4fi oa 1 

I U.*tO /O I 


1 75 


1 o I 
1Z [ 


^7 

D/ 


cpn TO Mfl' 53*307 
OCl^ XU IMU. 00:7Z j 


I n 4 C » oa I 

t UitJ /O j 


1 72 


13 J 


152 


. SEQ ID NO: 8393 


I 0.43 % 


1 7 


'14 | 


43 


SEQ ID NO: 8394 j 


I 0.37% I 


i • 6 


• 15 1 


63 


SEQ ID NO: 8395 


I 0.37 % 


1 6 


16 


72 


SEQ ID NO: 8396 


! 0.37 % 


1 6 


17 


89 


i SEQ ID NO: 8397 


I 0.37 % 


HH 


18 


101 


SEQ ID NO: 8398 


0.37 % 




19 


107 


SEQ ID NO: 8399 


S 0.37 % 


1 6 


20 


! 111 


SEQ ID NO: 8400 I 0.37 % 


1 6 







HLA A 0201- 


9 mere 




Maximum possible score using this molecule type 


3925227.1 H 


Rank) 


Start position 


Sequence 


% of max. score 


Score | 


1 l 


138 


II SEQ ID NO: 8401 


0.21 % 


8532.082944 I 


2 1 


106 


j SEQ ID NO: 8402 


0.10 % 


3977.8497792 I 


3 


44 


II SEQ ID NO: 8403 


0.03 % 


1243.078056 | 


4 1 


71 


I SEQ ID NO: 8404 


0.00 % 


! 348.872832 | 


5 1 


I 234 


II SEQ ID NO: 8405 


0.00 % 


I 243.432 


6 1 


51 


|| SEQ ID NO: 8406 


0.00 % 


I 130.26096 1 


7 | 


I 109 


II SEQ ID NO: 8407 


0.00 % 


i 91.182672 


8 1 


1 81 


II SEQ ID NO: 8408 


0.00 % 


I 73.342584 


9 1 


! 88 


II SEQ ID NO: 8409 


0.00 % 


I 70.386624 


10 1 


1 1 


1 SEQ ID NO: 8410 


0.00 % 


I 65.32728732 


11 1 


1 38 


II SEQ ID NO: 8411 


0.00 % 


I 47.876409 
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I 1 7 


76 


SEO ID NO: 8412 


0.00 % 


36.8637882 


1 ^ 


46 


SEO ID NO- 8413 


0.00 % 


30 889782 




71 1 


SEO ID NO* 8414 


I 0 00 % 

V/iVv /U 


21 616753941 


1 ^ 
ID 


701 


SEO ID NO- 8415 

««/^V£ XL/ llwi W • A*/ 


0 00 % 


! 19 657134 


I 1 




SEO ID NO- 8416 


0 00 % 


18 4318941 


I 1 7 


199 


SEQ ID NO: 8417 


0.00 % 


16.496865 


I 18 


74 


SEQ ID NO: 8418 


0.00 % 


15.783256167 


1 19 


62 


SEQ ID NO: 8419 


0.00 % 


13.9968225 


1 20 


99 


SEQ ID NO: 8420 


0.00 % 


10.31851392 






r HLA A 0201 - 10 mers 


iMaximum possible score using this molecule type 


3925227.1 




I Start position | 


Sequence 


% of max. score 


Score 


1 


78 I 


SEQ ID NO: 8421 


0.01 % 


556.494246 


2 


138 ! 


SEQ ID NO: 8422 


0.01 % 


395.245972224 


"3 
3 


84 | 


SEQ ID NO: 8423 


0.00 % 


201.554244 


4 


71 1 


SEQ ID NO: 8424 


U.UU to 


143.65707264 


5 


44 


SEQ ID NO: 8425 


0.00 % 




6 


76 | 


SEQ ID NO: 8426 


0.00 % 


84.78671286 


7 




SEQ ID NO: 8427 


0.00 % 


69.552 


8 


211 


SEQ ID NO: 8428 


0.00 % 


52.7237901 


9 


113 


SEQ ID NO: 8429 


0.00 % 


47.99088 


10 


61 


SEQ ID NO: 8430 


0.00 % 


37.4509575 


11 


93 


SEQ ID NO: 8431 


0.00 % 


31.24872 


12 


137 | 


SEQ ID NO: 8432 


0.00 % 


31.1384304 


13 


37 I 


SEQ ID NO: 8433 


0.00 % 


27.531 


14 


55 


SEQ ID NO: 8434 


0.00 % 


22.9153278 


15 


98 1 SEQ ID NO: 8435 


0.00 % 


22.1063618985 


16 


108 i 


I SEQ ID NO: 8436 


0.00 % 


21.55457052 


17 


63 


I SEQ ID NO: 8437 


0.00 % 


21.3624 


18 


45 


I SEQ ID NO: 8438 


0.00 % 


19.657134 


19 


200 


I SEQ ID NO: 8439 


0.00 % 


19.657134 


20 


104 


ISEQ ID NO: 8440 


0.00 % 


13.87622016 



I HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 


1 Rank | 


Start position 


Sequence 


% of max. score 


Score 


1 1 1 


58 


SEQ ID NO: 8441 


5.55 % 


2 


1 2 | 


125 


SEQ ID NO: 8442 


1.66 % 


0.6 
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3 


226 


SEO ID NO: 8443 


1.66 % 


0.6 


4 


229 


SEO ID NO: 8444 


1.66 % 


0.6 






HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


122 | 


SEO ID NO: 8445 


2.22 % 


0.8 


2 


228 


SEO ID NO: 8446 


2.22 % 


0.8 






HLA B7 - 9 mers I 


Maximum possible score using this molecule type 


5400J 


inma 


i Start position 


Sequence 


% of max. score 


Score I 


-M = 


SEQ ID NO: 8447 i 


0.66 % 


. 36 | 


2 1 86 


SEQ ID NO: 8448 I 


0.37 % 


. 20 | 


3 II 37 


SEQ ID NO: 8449 \ 


0.33 % 


- — 1 

18 | 


4 || 62 || SEQ ID NO: 8450 | 


0.33 % 


18 | 


5 1 32 


SEQ ID NO: 8451 | 


0.22 % • 


12 J 


6 || 102 


SEQ ID NO: 8452 1 


0.22 % : 




7 II 227 


SEQ ID NO: 8453 ! 


0.22 % 


:' 12. 1 


! 8 || 53 


SEQIDNO:8454 | 


0.11 % 


. 6 I 


9 II 1 


SEQ ID NO: 8455 1 


0.07 % 


4 | 


10 II 44 


SEQ ID NO: 8456 I 


0.07 % 


4 I 


11 II 56 1 


SEQ ID NO: 8457 | 


0.07% 


• 4 | 


12 || 64 ! 


SEQ ID NO: 8458 | 


0.07 % 


4 ! 


13 i! 74 | SEQ ID NO: 8459 | 


0.07 % 


' 4 i 


14 j 76 1 SEQ ID NO: 8460 ! 


0.07 % 


4 j 


15 || 87 II SEQ ID NO: 8461 i 


0.07 % 


4 


16 j 106 II SEQ ID NO: 8462 


0.07 % 


4 


17 | 131 ' 


SEQ ID NO: 8463 


0.07 % 


4 ! 


18 II 23 


SEQ ID NO: 8464 


0.03 % 


2 I 


19 1 157 


SEQ ID NO: 8465 


0.03 % 


2 


20 II 166 


SEQ ID NO: 8466 


0.03 % 


2 




1 HLA B7 - 10 mers 


iMaximum possible score using this molecule type 


5400 


I Rank 


Start position 


Sequence 


% of max. score 


Score 


1 1 


119 


SEQ ID NO: 8467 


3.33 % 


180 


I 2 


264 


SEQ ID NO: 8468 


1.48 % 


80 


1 3 


98 


SEQ ID NO: 8469 


0.66 % 


36 
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4 


27 


SEO ID NO: 8470 fl 


0.37 % | 20 


5 


86 


SEO ID NO: 8471 || 


fl "37 OA 
U.j/ vo 


zu 


6 


31 


SEO ID NO: 8472 II 


0.22 vo 


12 


7 


63 


SEO ID NO: 8473 H 


n oa 
0.22 vo 


12 


8 


96 


SEQ ID NO: 8474 || 


n o/ 
0.22 vo 


12 


9 


i 101 


SEO ID NO: 8475 II 


n oa 
0.22 vo 


12 


10 


226 


SEO ID NO: 8476 II 


0.22 % 


12 


11 


157 


SEO ID NO" 8477 U 


0.14 vo 


o 

8 


12 


176 


SEO ID NO: 8478 II 


f\ 4 A Of 

0-14 vo 


8 


13 


238 


SEO ID NO: 8479 II 


A i i ft/ 

0-14 % 


_ 8 


14 


36 


SEO ID NO: 8480 II 


0.11 VO 


o 


15 


53 


SEO ID NO: 8481 II 


0 11 % 
17 • X X /U 




16 


61 


SEQ ID NO: 8482 B 


0.11 % 


6 


17 


3 


SEQ ID NO: 8483 II 


0.07 % 


4 


18 


40 


SEQ ID NO: 8484 II 


0.07% 


4 


19 


55 


SEQ ID NO: 8485 II : 


0.07 % 


4 


1 20 


74 


SEQ ID NO: 8486 II 


0.07 % 


4 



Table 18: Epitopes for SEQ ID NO: 6044 



HLA Al - 9 mers 


Maximum possible score using this molecule type 


5625 


Rank 


I Start position 


Sequence 


°/o of max. score 


Score 


1 


69 


SEQ ID NO: 8487 


0.04 % 


2.5 


2 


89 


SEQ ID NO: 8488 


0.02 % 


1.5 


3 


141 


SEQ ID NO: 8489 


! 0.01% 


1 


4 


113 


SEQ ID NO: 8490 


0.00 % 


0.5 






HLA Al - 10 mers 


Maximum possible score using this molecule type | 5625 


Rank 


Start position 


Sequence 


% of max. score I 


Score 


1 


21 


SEQ ID NO: 8491 


0.02 % 


1.5 


2 


88 


SEQ ID NO: 8492 


0.02 % 


1.5 


3 


8 


SEQ ID NO: 8493 


0.02 % 


1.25 


4 


31 


SEQ ID NO: 8494 


0.00 % 1 


0.5 


5 


I 112 


SEQ ID NO: 8495 


L 0.00 % 


0.5 



5 



11 HLA A3 - 9 mers' 
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Maximum possible score using this molecule type 


1 ?1 


Rank 


Start position 


Sequence 


°/o of max. score 




1 


L. "0 


SEQ ID NO 


: 8496 


H TO ft/ 

1.23 % 


1 K(\ 


*> 

2 


77 


r»rn Trv 

SEQ ID NO 


, 8497 


4 H H ft/ 
1.11 % 


1 ^ 


3 


4 J 4 

141 


SEQ ID NO 


: 8498 


ft /ft ft/ 

0,49 % 




4 


95 


SEQ ID NO: 


: 8499 


0.32 % 




5 


128 


SEQ ID NO: 


: 8500 ! 


0.08 % 


1 n 

1U 


6 


113 


SEQ ID NO: 


: 8501 


0.04 % 


O 


7 


69 


SEQ ID NO 


: 8502 


0.01 % 


-> 


8 


22 


SEQ ID NO 


: 8503 


0.01 % 


1 Q 
l.O 


9 


42 


SEQ ID NO: 


, 8504 


0.01 % 


1 Q 
l.O 


10 


78 


SEQ ID NO: 


: 8505 


0.00 % 


l.Z 


11 


32 


SEQ ID NO: 


: 8506 


0.00 % 


1 


12 


54 


ccn Tr» Mfv 
ofcQ 1U NU, 




0.00 % 


n o 
i u.y 






SEQ ID NO: 


: 8508 


U.UU vo 


n o 
u.y 


14 


28 


SEQ ID NO: 


: 8509 


0.00 % 


0 6 


15 


I 36 


SEQ ID NO 


: 8510 


0.00 % 


0.6 


16 


48 


SEQ ID NO: 


: 8511 


0.00 % 


0.6 


17 


118 


SEQ ID NO: 


: 8512 


0.00 % 


0.6 


18 


4 


SEQ ID NO. 


8513 


0.00 % 


I 0.5 



*~ HLA A3 - 10 mers 


Maximum possible score using this molecule type 


12150 


Rank 


Start position | 


Sequence 


% of max. score 


Score 


1 


94 


SEQ ID NO: 


: 8514 


0.49 % 


60 


2 


48 


SEQ ID NO: 


: 8515 


0.16 % 


20 


3 


128 


SEQ ID NO 


: 8516 


0.16% 


20 


4 


60 


SEQ ID NO 


: 8517 


0.12 % 


15 


5 


! 127 


I SEQ ID NO 


: 8518 


0.12% 


15 


6 


1 25 


SEQ ID NO 


: 8519 


0.04 % 


1 6 


7 


1 95 


SEQ ID NO 


: 8520 


0.04 % 


6 


8 


141 


SEQ ID NO 


: 8521 


0.04 % 


6 


9 


41 


SEQ ID NO 


: 8522 


0.04 % 


5.4 


10 


77 


SEQ ID NO 


: 8523 


0.04 % 


5.4 


11 


116 


SEQ ID NO 


: 8524 


0.04 % 


5.4 


12 


91 


SEQ ID NO 


: 8525 


0.03 % B 4 


13 


4 


SEQ ID NO 


: 8526 


0.01 % 


2 


14 


112 


SEQ ID NO 


: 8527 


0.01 % 


1.8 


15 


1 113 


SEQ ID NO 


: 8528 


0.01 % 


I 1.35 



-433- 



WO 2004/092360 



PCT/US2004/011710 



16 H 12 


SEO ID NO* 8529 


0.00 % 


1 1.2 


17 « 31 


SEQ ID NO: 8530 


0.00 % 


1 


18 


! 32 


SEQ ID NO: 8531 


0.00 % 


1 


19 


1 15 


SEQ ID NO: 8532 


0.00 % 


I 0.9 


20 


1 27 


SEQ ID NO: 8533 


• 0.00% I 0.9 




HLA A24 - 9 mers 


Maximum possible score using this molecule ty 


pe 


1596.672 


Rank! Start position 


Sequence | 


% of max. score 


Score 


1 | 61 


SEQ ID NO: 8534 | 


14.46 % 


231 


2 I 


16 


SEQ ID NO: 8535 I 


3.13% 


50 


— r — i 

3 I 


120 


SEQ ID NO: 8536 I 


1.87 % 


30 


— z — 1 

4 I 


41 


SEQ ID NO: 8537 I 


0.60 % 


9.6 


5 


71 


SEQ ID NO: 8538 [ 


0.45 % 


7.2 


*> 1 


21 


SEQ ID NO: 8539 [ 


0.37 % 


6 


7 | 


53 


SEQJD NO: 8540 | 


0.37 % 


6 


8 


65 


SEQ ID NO: 8541 | 


0.37 % 


• 6 J 


9 


121 


SEQ ID NO: 8542 I 


U.3/ /O 


D | 


10 


74 


SEQ ID NO: 8543 I 


0.36 % ' 


i 5.76 


11 


20 


SEQ ID NO: 8544 | 


0.35 % 


5.6 


12 


79 


SEQ ID NO: 8545 1 


0.35 % 


5.6 


13 


105 


SEQ ID NO: 8546 | 


0.33 % ■'■ 


5.28 


14 


48 


SEQ ID NO: 8547 1 


0.30 % : 


4.8 


15 


88 


SEQ ID NO: 8548 


0.30 % 


4.8 


16 


106 


SEQ ID NO: 8549 I 


0.30 % 


4.8 


17 


37 


SEQ ID NO: 8550 1 


0.27 % 


4.4 


18 


70 


SEQ ID NO: 8551 1 


0.27 % 


4.4 


19 


18 1 


SEQ ID NO: 8552 


0.25 % 


4 1! 


20 


57 || SEQ ID NO: 8553 


0.22 % 


3.6 S 



HLA A24 - 10 mers 


Maximum possible score using this molecule ty 


pe 


1596.672 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


120 


SEQ ID NO: 8554 


1.87 % 


30 


2 


73 


SEQ ID NO: 8555 


0.54 % 


8.64 


3 


19 


SEQ ID NO: 8556 


0.52 % 


8.4 


4 


78 


SEQ ID NO: 8557 


0.52 % 


8.4 


! 5 


104 


SEQ ID NO: 8558 


0.49 % 


7.92 


6 


61 


! SEQ ID NO: 8559 


0.46 % 


7.5 
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7 


1 47 


II SEO ID NO" 8560 


0.45 % 


7.2 


Q 
O 


r 36 


II SEO ID NO: 8561 


0.41 % 


6.6 


Q 


! 52 


H SFO ID NO' 8562 1 


0 37 % 

V/ a %J i /O 


6 


1 0 


1 64 


H SEO ID NO- 8563 


0.30 % 


4.8 


1 1 

H 1 


1 70 
1 /u 


1 ^FO TD NO* R c 5fi4 


0 "30 % 


4 ft 




lUJ 


II QFH TH NO* ft^fi 1 ^ 


n ^o % 

U.JU /O 


4 ft 


1 J 


1 9*3 
IZj 


II QFO Tn MO* ft^fifi 
II Dl^ XL/ IMUi OjUU 


U.JU /o 


4 ft 


1 14 




Hi QFH Tn MO- ft^fi7 


0 97 Q/L 


4 4 


1 1^ 
I 15 




!| CFH Tn MO 1 ft^fift 
j| DC^ XL/ NU. OjUO 


U.«£3 /O 


*T 


1 16 


66 


II SEO ID NO: 8569 


0.25 % 


4 


17 


83 


1 SEQ ID NO: 8570 


1 0.25 % 


4 


18 


86 


II SEQ ID NO: 8571 


I 0.25 % 


4 


19 


101 


H SEQ ID NO: 8572 


I 0.25 % 


! : s 4 


20 


119 


1 SEQ ID NO: 8573 


I 0.25 % 


•4 







HLA A 0201 - 


9 mers 




Maximum possible score using this molecule type 




Rank 


Start position | 


[ Sequence 


% of max. score 


Score 


1 


62 I 


I SEQ ID NO: 8574 | 


0.00 % 


136.1646 


2 


85 ! 


I SEQ ID NO: 8575 | 


0.00 % 


69.6969 


3 


47 


I SEQ ID NO: 8576 | 


0.00 % 


60.153786 


4 


1 121 I 


I SEQ ID NO: 8577 | 


0.00 % i 


52.5182736 


5 


1 74 I 


I SEQ ID NO: 8578 | 


0.00 % 


49.13352 


6 


1 23 | 


1 SEQ ID NO: 8579 | 


0.00 % 


21.99582 


7 


1 78 


I SEQ ID NO: 8580 I 


i 0.00 % I 


19.42488 


8 


1 114 | 


I SEQ ID NO: 8581 I 


! o.oo % 


14.6900655 


9 D 4 1 SEQ ID NO: 8582 | 0.00 % 


11.304684 


10 | 79 


1 SEQ ID NO: 8583 I 


0.00 % 


8.4687081 


11 


1 122 


1 SEQ ID NO: 8584 | 


0.00 % 


6.0996 


12 


1 100 I 


I SEQ ID NO: 8585 | 


0.00 % 


5.382 


13 


I 105 II SEQ ID NO: 8586 


0.00 % 


4.981593 


14 


1 25 


I SEQ ID NO: 8587 


0.00 % 1 


4.968 


15 


I 115 II SEQ ID NO: 8588 I 


0.00 % 


4.966482 


16 


1 24 


I SEQ ID NO: 8589 j 


0.00 % 


4.4815221585 


17 


1 111 


I SEQ ID NO: 8590 


0.00 % 


4.128201 


18 


1 94 


I SEQ ID NO: 8591 


0.00 % 


3.67632 


19 


1 34 


I SEQ ID NO: 8592 


0.00 % 


3.47553 


20 


1 12 


I SEQ ID NO: 8593 


0.00 % 


3.30993 
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F HLA A 0201 - 10 mers 


iMaximum possible score using this molecule type 


0925227. 1 


*% — - i - 

Rank 


Start position 


Sequence 


°/o or max. score 


Score 


1 


77 


SEQ ID NO: 8594 


U.UO % 


147.97188 


2 


62 


SEQ ID NO: 8595 


0.00 % 


143.59176 I 


3 


113 


SEQ ID NO: 8596 


0.00 % 


106.83684 


4 


H~ 78 


SEQ ID NO: 8597 


0.00 % 


83.526984 


5 


86 


SEQ ID NO: 8598 


0.00 % 


83.526984 


6 


74 


SEQ ID NO: 8599 


0.00 % 


69.552 1 


7 


4 *^ 4 

121 


SEQ ID NO: 8600 I 


0.00 % 


61.06776 


8 


12 


SEQ ID NO: 8601 I 0.00 % 


50.232 


9 


44 


SEQ ID NO: 8602 | 


0.00 % 


26.082 


10 


4 


SEQ ID NO: 8603 I 


0.00 % 


18.3816 


11 


0 


SEQ ID NO: 8604 | 


0.00 % 


17.38386 


12 


i 72 


SEQ ID NO: 8605 ! 


0.00 % 


17.1396 


13 


22 


SEQ ID NO: 8606 


0.00 % 


16.21914 


14 


I 122 


SEQ ID NO: 8607 


0.00 % 


14.02908 


15 


64 


SEQ ID NO: 8608 


0.00 % 


11 1 fi1 


16 


i 46 


SEQ ID NO: 8609 


0.00 % 


10.34586 




54 


SEQ ID NO: 8610 | 0.00 % 


I 8.846145 


HH 


1 47 


SEQ ID NO: 8611 


0.00 % 


7.575080337 


19 | 131 


SEQ ID NO: 8612 


0.00 % 


7.452 | 


20 || 114 


SEQ ID NO: 8613 


0.00 % 


I 6.735366 



"~ HLA A 1101 -9 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


69 


SEQ ID NO: 8614 


5.55 % 


2 


2 


22 


SEQ ID NO: 8615 


5 % 


1.8 


3 


77 


SEQ ID NO: 8616 


5 % 


1.8 


4 


141 


SEQ ID NO: 8617 


I 3.33 % 


1.2 


5 


60 


SEQ ID NO: 8618 


2.22 % 


0.8 


! 6 


95 


SEQ ID NO: 8619 


2.22 % 


0.8 


. 7 


! 36 | SEQ ID NO: 8620 


1.66 % 


0.6 



\ HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank || Start position 


Sequence 


% of max. score 


Score 


1 II 41 


SEQ ID NO: 8621 


3.33 % 


1.2 
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68 


SEQ ID NO: 8622 


I 3.33 % 


1.2 


hi- 


94 


SEQ ID NO: 8623 


I 3.33 % 


1.2 


4 


31 


SEQ ID NO: 8624 U 2.77 % 


1 


5 


127 


SEQ ID NO: 8625 II 2.5 % 


0.9 




HLA B7 - 9 mere I 


Maximum possible score using this molecule type 


5400 | 


Rank II Start position 


Sequence 


I % of max. score 


Score | 


1 II 48 


SEQ ID NO: 8626 


f 0.74 % 


40 


2 | 


] 20 


SEQ ID NO: 8627 


I 0.37 % 


i 20 


3 H 121 


SEQ ID NO: 8628 


I 0.33 % 


18 


4 1 18 


SEQ ID NO: 8629 


r o.o7 % . 


4 


5 B 21 


SEQ ID NO: 8630 


I 0.07 % \ 


; 4 


6 H 37 


SEQ ID NO: 8631 


r o.o7 % 


4 


7 II 41 


SEQ ID NO: 8632 


0.07 % 


4 


8 1 .53 


SEQ ID NO: 8633 


0.07 % 


4 


9 II 65 


SEQ ID NO: 8634 


0.07 % f 


4 


10 || 70 


SEQ ID NO: 8635 


0.07 % 


4 


11 II 71 


SEQ ID NO: 8636 


' 0.07 % : 


4 


12 H 74 


SEQ ID NO: 8637 


0.07 % 


4 


13 || 79 fl SEQ ID NO: 8638 


0.07 % 


4 


14 || 88 


I SEQ ID NO: 8639 


0.07 % 


4 


15 | 105 


I SEQ ID NO: 8640 


0.07 % 


4 1 


16 | 


! 106 


1 SEQ ID NO: 8641 


0.07 % 


" 4 | 


17 | 


1 124 


1 SEQ ID NO: 8642 


0.07 % 


4 1 


18 | 


1 1 


1 SEQ ID NO: 8643 


0.03 % 


2 


19 ! 


1 120 


I SEQ ID NO: 8644 


0.03 % 


1.8 


20 | 


| 11 I SEQ ID NO: 8645 I 0.02 % 


1.2 




HLA B7 - 10 mere 


Maximum possible score using this molecule type 


5400 


I Rank | Start position 


Sequence 


L % of max. score 


Score 


1 1 II 66 


SEQ ID NO: 8646 


1.48 % 


80 


1 2 II 123 


SEQ ID NO: 8647 


I 0.74 % 


40 


1 3 II 20 


SEQ ID NO: 8648 


! 0.37 % 


20 


1 4 B 64 


SEQ ID NO: 8649 


I 0.22 % 


12 


1 5 II 119 


SEQ ID NO: 8650 


i 0.11 % 


6 


1 6 II 54 


SEQ ID NO: 8651 


j 0.09 % 


5 


1 7 1 19 


SEQ ID NO: 8652 


! 0.07 % 


4 
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8 


36 1 SEQ ID NO: 8653 


0.07 % 1 


4 


9 


47 


SFO ID NO" 8654 


0.07 % I 


4 


1 io 


52 


SFO ID NO' 8655 


0.07 % 


4 


11 




^FO TD NO* Rfi56 


0.07 % | 


4 


12 


70 


^FO TD NO* Rfi57 


0.07 % 


4 


13 


7*3 


^fh th wo- Rfi^R 


0.07 % 


4 


14 


78 
/ o 


^FO TO NO- Rfi^Q 

•JLy 1U IMU a OUJ7 


0.07 % 


4 


15 


OJ 




0.07 % | 


4 


16 


86 


SEQ ID NO: 8661 


0.07 % 


4 


17 


101 


SEQ ID NO: 8662 


1 0.07 % 


4 


18 


104 


SEQ ID NO: 8663 


0.07 % | 


4 


1 19 


105 


SEQ ID NO: 8664 


0.07 % 


4 


1 20 


15 


SEQ ID NO: 8665 


0.03 % 


2 


Table 19: Epitopes for SEQ ID NO: 6045 


HLA Al - 9 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


4 


SEQ ID NO: 8666 


0.02 % 


1.35 


2 


66 


SEQ ID NO: 8667 


0.02 % 


1.35 


3 


33 


SEQ ID NO: 8668 


0.02 % 


1.25 


4 


44 


SEQ ID NO: 8669 


0.01 % 


1 j 


5 


50 


SEQ ID NO: 8670 


0.01 % 


1 


6 


14 


SEQ ID NO: 8671 


0.01 % 


0.75 


7 


48 


SEQ ID NO: 8672 


0.01 % 


0.75 


8 | 11 


SEQ ID NO: 8673 


0.00 % 


0.5 



HLA Al - 10 mers 1 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


! Sequence 


% of max. score 


Score 


1 1 


4 


SEQ ID NO: 8674 


0.12 % 


6.75 


2 


66 


SEQ ID NO: 8675 


0.12 % 


6.75 


3 


10 


SEQ ID NO: 8676 


0.00 % 


0.5 


4 


28 


SEQ ID NO: 8677 


0.00 % 


0.5 


5 


32 


SEQ ID NO: 8678 


i 0.00 % 


0.5 


6 


47 


SEQ ID NO: 8679 


1 0.00 % 


0.5 



5 
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HLA A3 - 9 mers 


Maximum possible score using this molecule type 




KanK 


9LarC pOalillin 


Con i iflnro 


/o ot max* score 


score 


1 


JL / 


OCl^ LU IMVJ. OOOU 


U.Z4 vo 




Z 


AA 


ccn TO MH- 

DCl^ 1U IMU. OOOx 


n n~7 o/ 
U.U/ vo 


Q 




1 a 

iy 


oty 1U IMU. ouoZ 


n nc o/ 
U.UO vo 


Q 1 

. O.l 


4 


en 


ccn to MO- QCQO 
o£U 1U IMU. ouoj 


U.U4 vo 


C A 

3.4 


I C 


zy 


1 CCO TO MO« Q£Q/I 


n no o/ 
U.UJ vo 


A 


a 
O 


52 


1 CCO TO MO« OCOC 

[ btQ ID IMU: obo5 


0.02 % 


J.Z4 


7 


54 


1 SEO ID NO" 8686 


0.02 % 


O i 


8 


L n 


1 SEQ ID NO: 8687 


0.01 % 


1.8 


9 


37 


1 SEQ ID NO: 8688 • 


0.01 % 


1.8 


10 


25 


SEQ ID NO: 8689 


0.01 % 


1.35 


11 


10 1 


SEQ ID NO: 8690 


0.00 % 


0.9 


12 


16 | 


SEQ ID NO: 8691 


0.00 % 


0.9 


13 


35 | 


SEQ ID NO: 8692 


1 0.00 % I 0.6 



HLA A3 - 10 mere 


Maximum possible score using this molecule type 


I 12150 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


49 


SEQ ID NO: 8693 


0.44 % 


54 


2 


17 


SEQ ID NO: 8694 


0.22 % 


27 


' 3 


10 


SEQ ID NO: 8695 


0.14 % 


18 


4 


16 


SEQ ID NO: 8696 


0.07 % 


9 


5 


32- 


SEQ ID NO: 8697 


0.04 % 


6 


6 


19 


SEQ ID NO: 8698 


0.01 % 


1.8 


7 


29 


SEQ ID NO: 8699 


0.00 % 


1.2 


8 


23 


SEQ ID NO: 8700 


0.00 % 


0.9 


9 


26 


I SEQ ID NO: 8701 


0.00 % | 0.9 



HLA A24 - 9 mere 


Maximum possible score using this molecule type 


1596.672 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


18 


SEQ ID NO: 8702 


1.87 % 


30 


2 


24 


SEQ ID NO: 8703 


0.65 % 


10.5 


3 


9 


SEQ ID NO: 8704 


0.52 % 


8.4 


4 


12 


SEQ ID NO: 8705 


0.52 % 


8.4 


5 


28 


SEQ ID NO: 8706 


0.52 % 


8.4 


6 


42 1 


SEQ ID NO: 8707 


0.52 % 


8.4 
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7 | 57 


SEO ID NO 


: 8708 


0.52 % II 8.4 


8 


66 


SEQ ID NO 


: 8709 


0.52 % II 8.4 


9 


55 


SEQ ID NO 


: 8710 


0.51 % 1 8.25 


1 io H o 


SEQ ID NO 


: 8711 


0.48 % II 7 7 


1 11 8 22 


SEQ ID NO 


: 8712 


0.45 % II 7 2 


1 12 || 10 


SEQ ID NO 


: 8713 


0.37 % nr 6 


13 I 25 


SEO ID NO: 


: 8714 


0 37 % ~~ IT" 6 


14 


I 30 


SEO ID NO: 


• 8715 


0 37 % 1 6 


15 


1 19 . 


SEO ID NO: 


8716 


0.35% IF 5.6 


16 


1 40 


SEO ID NO: 


8717 


0.31 % II 5 


17 1 


1 3 


SEQ ID NO: 


8718 


0.30 % j 4.8 


18 


1 65 


SEQ ID NO: 


8719 


0.30 % II 4.8 


19 H 14 


SEQ ID NO: 


8720 


0.27 % 


4.32 


20 || 56 


SEQ ID NO: 


8721 


0.25 % 


4 




1 HLA A24 - 10 mere 


iMaximum possible score using this molecule type '; || 1596.672 


IE5GI5 


Start position 


Sequence 


1 % of max. score || Score 


m 


1 55 


SEQ ID NO: 8722 


! 18.78% I 300 


2 ! 


1 18 | 


SEQ ID NO: 8723 I 


S 2.63% \ U 42 


3 I 


1 21 I 


SEQ ID NO: 8724 | 


1 2.25% || 36 


4 | 


1 2 


SEQ ID NO: 8725 I 


1 1.87% : || 30 


5 I 


1 24 | 


SEQ ID NO: 8726 II 1.87 % " : I 


1 30 


6 I 


n 1 


SEQ ID NO: 8727 I 


I 0.52% '.. || 8.4 


7 40 


SEQ ID NO: 8728 1 


1 0.52% S 8.4 


8 || 65 


SEQ ID NO: 8729 I 


1 0.42% || 6.72 


9 || 9 


SEQ ID NO: 8730 II 0.37 % | 6 


10 | 


8 


SEQ ID NO: 8731 | 0.35 % 1 5.6 


1 11 1 


27 


SEQ ID NO: 8732 || 0.35 % || 5.6 


12 | 


1 41 


SEQ ID NO: 8733 


0.35 % I 5.6 


13 II 57 


SEQ ID NO: 8734 


0.31 % 1 5 


14 || 17 


SEQ ID NO: 8735 


0.25 % ir 4 


15 II 29 


SEQ ID NO: 8736 


0.25 % II 4 


16 I 64 


SEQ ID NO: 8737 


0.25 % i 


1 4 


17 || 16 


SEQ ID NO: 8738 


0.22 % | 


! 3.6 


18 || 10 || SEQ ID NO: 8739 


0.18 % | 


1 3 


19 | 13 1 SEQ ID NO: 8740 


0.18 % i 


S 2.88 


20 || 23 1 SEQ ID NO: 8741 


0.08 % II 1.4 



-440- 



WO 2004/092360 



PCT/US2004/011710 



| HLA A 0201 - 9 mers 



iMaximum Dossible score using this molecule type 






Start position ll Sequence 


% of max. score 


Score 


1 


19 H SEQ ID NO: 8742 


0.03 % 


IJlU.ooZjlJo 


2 


! 15 II SEQ ID NO: 8743 


I 0.02 % 


1082.4143022 


3 


16 II SEQ ID NO: 8744 1| 0.02 % 


1040-33238624 


4 


1 49 II SEQ ID NO: 8745 1| 0.00 % 


382.536 


5 II 25 II SEQ ID NO: 8746 II 0.00 % 


342.863529264 


6 || 56 II SEO ID NO: 8747 II 0.00 % 


63.28397376 


7 | 12 II SEO ID NO: 8748 II 0.00 % 


40.19736105 


8 j 10 H SEQ ID NO: 8749 II 0.00 % 


21.3624 


9 || 22 II SEQ ID NO: 8750 II 0.00 % 


19.7762418 


10 H 26 II SEO ID NO: 8751 II 0.00 % 


12.6684 


11 || 20 II SEQ ID NO: 8752 II 0.00% - 


11.544666 


12 


I 37 I SEQ ID NO: 8753 | 0.00 % 


10.4328 


13 


1 32 | SEQ ID NO: 8754 || 0.00 % 


8.4456 j 


14 


J 23 B SEQ ID NO: 8755 II 0.00 % 


6.2888049 


15 


47 1 SEQ ID NO: 8756 1| 0.00 % 


6 0858 


16 


1 3 • I SEQ ID NO: 8757 1| 0.00 % 


1 4.582929078 


17 II 18 


I SEQ ID NO: 8758 H 0.00 % 


1 4.4855150505 


18 8 28 


1 SEQ ID NO: 8759 1 0.00 % 


1 4.2923589 


19 II 62 1 SEQ ID NO: 8760 || 0.00 % 


I 2.88098391 


20 


| 27 II SEQ ID NO: 8761 II 0.00 % 


I 1.699677 



HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


II 3925227.1 


Rank 


1 Start position || Sequence II % of max. score 


H - Score 


1 


1 


17 


II SEQ ID NO: 8762 1| 


0.16 % 


II 6459.14167272 


2 


1 


19 


H SEQ ID NO: 8763 II 


0.01 % 


607.88448 


3 


1 


25 


II SEQ ID NO: 8764 It 


0.00 % 


1 126.83304 


4 




11 


II SEQ ID NO: 8765 \ 


0.00 % 


II 63.16728165 I 


5 




15 


i SEQ ID NO: 8766 H 


0.00 % 


j 53.54651988 


6 


1 


37 


II SEO ID NO: 8767 II 


0.00 % 


1 28.51632 i 


7 


II 


14 


II SEQ ID NO: 8768 1| 


0.00 % 


II 21.8247414 | 


8 


II 


29 


II SEO ID NO: 8769 H 


0.00 % 


II 21.3624 S 


9 




26 


II SEQ ID NO: 8770 1| 


0.00 % 


II 19.42488 


10 


II 


3 


11 SEO ID NO: 8771 1| 


0.00 % 


II 17.2167282 


11 




48 


II SEQ ID NO: 8772 1| 


0.00 % 


II 15.7068219 


12 


II 


12 


II SEQ ID NO: 8773 I 


0.00 % 


II 9.8581266 
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13 


27 


SEQ ID NO: 8774 


0.00 % 


7.3086111 


14 


39 


SEQ ID NO: 8775 


r 0.00 % 


7.10976 


15 


23 


SEQ ID NO: 8776 


0.00 % 


5.7419523 


16 


22 | 


SEQ ID NO: 8777 


0.00 % 


4.599126 


17 


45 | 


SEQ ID NO: 8778 


0.00 % 


2.5495155 


18 


31 1 


SEQ ID NO: 8779 


0.00 % 


2.52747 


19 


52 1 


SEQ ID NO: 8780 


0.00 % 


2.383605 


20 


20 


SEQ ID NO: 8781 


0.00 % 


2.332847151 



HLA A 1101 - 9 mere 


Maximum possible score using this molecule type 


36 


Rank II Start position 


Sequence 


% of max. score 


Score 


1 1 44 


SEQ ID NO: 8782 


3.33 % 


1.2 



HLA A 1101 - 10 mere 


Maximum possible score using this molecule type 


36 


Rank || Start position j 


Sequence H % of max. score 


Score 



HLA B7 - 9 mers 


Maximum possible score using this molecule type ~ | 


1 5400 


Rank | 


Start position | 


| Sequence 


% of max. score | 


| Score 


1 1 


3 


SEQ ID NO: 8783 


0.37 % 


I 20 


2 I 


12 


SEQ ID NO: 8784 


0.37 % 


1 20 


3 I 


22 


SEQ ID NO: 8785 


0.37 % 


! 20 


4 I 


56 


SEQ ID NO: 8786 1 


0.37 % | 


! 20 


5 | 


30 


SEQ ID NO: 8787 | 


0.22 % 


1 12 


_6 | 


: 9 


SEQ ID NO: 8788 1 


0.07 % 


1 4 


7 | 


10 


SEQ ID NO: 8789 • 


0.07 % ! 


! 4 


8 I 


19 


SEQ ID NO: 8790 1 


0.07 % ! 


1 4 


9 I 


; 25 


SEQ ID NO: 8791 I 


0.07 % | 


1 4 


10 1 


1 28 


SEQ ID NO: 8792 1 


0.07 % 


1 4 


U 1 


42 


SEQ ID NO: 8793 I 


0.07 % ! 


1 4 


12 I 


65 


SEQ ID NO: 8794 1 


! 0.07 % 1 


1 4 


13 | 


1 35 


SEQ ID NO: 8795 1 


I 0.05 % 


1 3 


14 | 


! 66 


SEQ ID NO: 8796 I 


I 0.02 % 


I 1.2 


15 I 


1 15 


SEQ ID NO: 8797 | 


! 0.01 % 


1 1 


16 | 


1 47 


SEQ ID NO: 8798 || 0.01 % 1 1 


17 | 


i 20 


I SEQ ID NO: 8799 I 


I 0.01 % | 


I 0.6 


18 | 


| 23 


1 SEO ID NO: 8800 


1 o.oo % ! 


i 0.5 
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[ 19 j 27 "I SEP ID NO: 8801 11 0.00 % II O.sZj 



HLA B7 - 10 mere 


Maximum possible score usinq this molecule type | 




■ - 1 

Rank 


start position 


sequence | 


Yfo ot max. score | 


Score 


i 


27 


CCA Tp\ M/"> ■ QQA1 1 

SEQ ID NU. ooUz | 


0- 37 vo 1 




2 | 


= = 

8 ! 


SEQ ID NO: 8803 | 


0.07 % 1 


A 

4 


3 ! 


9 


SEQ ID NO: oo04 j 


0.07 % j 


4 


4 


11 


SEQ ID NO: 8805 | 


0.07 % 


4 


5 


17 


SEQ ID NO: 8806 


0.07 % 


4 


6 I 


29 


SEQ ID NO: 8807 | 


0.07 % _| 


i 4 


7 | 


41 


SEQ ID NO: 8808 \ 


0.07 % 


VI 
4 


8 


52 


SEQ ID NO: 8809 


0.07 % 


4 


9 


64 


SEQ ID NO: oolO j 


0.07 % I 


4 


10 1 


65 


SEQ_ID NO: 8811 | 


0.07 % 1 


4 


11 II 3 


SEQ ID NO: 8812 


A/ 1 

0.03 % 


2 


12 1 23 


SEQ ID NO: 8813 | 


0.03 % 


*> 
z 


13 1 21 


SEQ ID NO: 8814 | 


0.02 % | 


1.2 


14 | 15 


SEQ ID NO: 8815 | 


0.01 % : ! 


1 




35 


SEQ ID NO: 8816 I 


! 0.01 % I 


0.6 


16 


39 


SEQ ID NO: 8817 1 


o.oi % ! 


,0.6 


17 1 


12 


SEQ ID NO: 8818 I 


i 0.00 % | 


0.5 


18 | 


22 


SEQ ID NO: 8819 I 


0.00% 1 


0.5 


19 i 


45 


SEQ ID NO: 8820 | 


0.00 % 1 


. 0.5 


Table 20: Epitopes for SEQ ID NO: 6046 


HLA Al - 9 mere 


Maximum possible score using this molecule type 


5625 


Rank I 


Start position 


Sequence 


% of max. score 


Score 


l ! 


186 


SEQ ID NO: 8821 


2.22 % 


125 


2 


r 156 


SEQ ID NO: 8822 


I 0.88 % 


50 


3 | 14 


SEQ ID NO: 8823 


I 0.08 % 


4.5 


4 


o 


SEQ ID NO: 8824 


0.04 % 


2.5 


5 


29 


SEQ ID NO: 8825 


0.04 % 


2.5 


6 


85 


SEQ ID NO: 8826 


0.04 % 


2.5 


7 


168 


SEQ ID NO: 8827 


0.04 % 


2.5 


8 


133 


SEQ ID NO: 8828 


0.02 % 


1.35 
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9 


111 


SEO ID NO: 8829 


0.02 % 


1.125 


10 


61 


SEQ ID NO: 8830 


0.01 % 


1 


11 


7 


SEQ ID NO: 8831 


I 0.01 % 


0.9 


12 


131 


SEQ ID NO: 8832 

, , L» — — _ 1 


0.01 % 


0.9 


13 


211 


SEO ID NO: 8833 


0.01 % 


0.625 


14 


4 


SEQ ID NO: 8834 


0.00 % 


0.5 


15 


43 


SEQ ID NO: 8835 


0.00 % 


0.5 


16 


95 


SEQ ID NO: 8836 


0.00 % 


0.5 


17 


136 


SEQ ID NO: 8837 


1 0.00 % 


0.5 



HLA Al - 10 mers 


Maximum possible score using this molecule type 


cor 


Rank 


Start position 


Sequence 


% of max. score 


Score 


i 


133 . 


SEQ ID NO: 8838 | 


0.04 % 


2.7 


2 1 


84 


SEQ ID NO: 8839 1 


0.04 % 


2.5 


3 1 


167 


I SEQ ID NO: 8840 I 


0.04 % 


2.5 


4 


186 


SEQ ID NO: 8841 I 


0.04 % 


2.5 


5 


131 


SEQ ID NO: 8842 


0.04 % 


2.25 


6 I 


14 


SEQ ID NO: 8843 I 


0.03% 


1.8 


7 | 


205 


SEQ ID NO: 8844 


0.02 % 


1.25 


8 


111 


SEQ ID NO: 8845 


0.02 % 


1.125 


9 B 60 


SEQ ID NO: 8846 


0.01 % 


1 


• io 1 


188 


SEQ ID NO: 8847 


0.01 % 


0.75 




211 


SEQ ID NO: 8848 I 


0.01 % 


0.625 


12 


26 


SEQ ID NO: 8849 I 


0.00 % 


0.5 


13 


94 


SEQ ID NO: 8850 I 


0.00 % 


0.5 


14 


135 


SEQ ID NO: 8851 I 


0.00 % 


0.5 


15 I 168 


SEQ ID NO: 8852 ! 


0.00 % 


0.5 




HLA A3 - 9 mers 


Maximum possible score using this molecule type 


12150 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


43 


SEQ ID NO: 8853 


0.24 % 


30 


2 


90 


SEQ ID NO: 8854 


0.14 % 


18 


3 


148 


SEO ID NO: 8855 


0.09 % 


12 


4 


4 


SEQ ID NO: 8856 


0.05 % 


6.75 


5 


r 24 


SEO ID NO: 8857 


0.04 % 


6 


6 


19 


SEQ ID NO: 8858 


1 0.04 % 


5.4 


7 


136 


SEO ID NO: 8859 


0.04 % 


5.4 
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8 1 


54 


SEQ ID NO: 8860 R 0.03 % 


4.5 


Q ! 


32 


SEQ ID NO: 8861 II 0.03 % 


4 


10 1 


14 


ccn Tn MfV Rftfi? 

OCy IU INU. OOD£ 


U.UZ vO 


3.6 


11 1 

X X j 


59 


cert TT» MfV QfiA*3 


n no oa 


3.6 


12 1 

1c. j 


88 


CCO TP\ MO- QQCA 


n no ojl 
U.UZ vo 


3 


1 1 i 


87 


OC^l IU IMU. ooDD 


n no 0/ 
U.UZ vo 


2 7 


14 i 




Tl-V MO. oocc 

SEQ ID NO: oobo 


0.01 % 


1 8 


1 R I 


48 


SEQ ID NO: 8867 


0.01 % 


1 8 


16 1 


11C 


SEQ ID NO: 8868 


r 0.01 % 


1 8 
x.o 


17 | 


186 


SEQ ID NO: 8869 


0.01 % 


1.8 


18 | 


106 


SEQ ID NO: 8870 


0.01 % 


1.5 


19- I 


53 


SEQ ID NO: 8871 


0.01 % 


1.35 


20 


173 


SEQ ID NO: 8872 


0.00 % 


1.2 




HLA A3 - 10 mers 


Maximum possible score using this molecule type I 


| 12150 


Danv 

KdllK | 


Start position 


Sequence 


% of max. score j 


Score 


l 


24 


SEQ ID NO: 8873 


0.22 % 




2 


54 


SEQ ID NO: 8874 


0.18% 


22.5 


3 


135 


SEQ ID NO: 8875 


0.08 % 


10.8 


4 


51 


SEQ ID NO: 8876 


0.07 % 


9 


5 


13 


SEQ ID NO: 8877 


0.06 % ■ 


8.1 


6 


26 


SEQ ID NO: 8878 


0.04 % 


6 


7 


31 


SEQ ID NO: 8879 | 0.04 % , 


6 


8 


90 


SEQ ID NO: 8880 8 0.04 % 


6 


9 


43 


SEQ ID NO: 8881 


0.03 % 


4.5 


10 


19 


SEQ ID NO: 8882 


0.03 % 


4.05 


11 


169 


SEQ ID NO: 8883 


0.02 % 


3 


12 


87 II SEQ ID NO: 8884 


0.02 % 


2.7 


13 


1 84 || SEQ ID NO: 8885 


0.01 % 


1.8 


14 


i 88 ! SEQ ID NO: 8886 


0.01 % 


1.8 


15 


94 j SEQ ID NO: 8887 


0.01 % 


1.8 


16 


64 H SEQ ID NO: 8888 


0.00 % 


1.2 


17 


131 \ SEQ ID NO: 8889 


0.00 % 


1.2 


18 


99 II SEQ ID NO: 8890 


0.00 % 


1 


19 


53 II SEQ ID NO: 8891 


0.00 % 


0.9 


20 


85 II SEQ ID NO: 8892 


0.00 % 


0.9 



HLA A24 - 9 mers 
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iMaximum possible score using this molecule type 


1596.672 


Rank 

IxalllV 


Start position H Sequence 


1 *Va of max. score 


Score 


1 
X 


196 


SEO TD NO: 8893 


! 27 55 % 


440 


2 


44 

r r 


SEO TD NO: 8894 


1 18 78 % 


L 300 




JO 


^FO Tn NO - RRQ5 

JLy XL/ lvVJ> 007J 


1 12 52 % 


200 


A 




<?FO TH NO' RRQfi 


1 1 9 ^9 % 


200 


c 

D 


1 no 


^FO Tn NO - RRQ7 

JLy XL/ iMV/a 00-7/ 


I ? 70 % 


43.2 


a 

U 


ZD 


^FH TH MO* RRQR 

OCy XL/ i>lv/. 0070 


I 1 R7 % 
I X.O/ /o 


[ 30 


/ 




SEQ ID NO: 8899 


I 1 1 9 o/ ft 
1 1.1^ /o 


18 


Q 
O 


1 


SEQ ID NO: 8900 


r 0.75 % 


12 


Q 




SEQ ID NO: 8901 


0.70 % 


11.2 




/ 


SEQ ID NO: 8902 


0.64 % 


10.368 


1 1 

J. X 




SEQ ID NO: 8903 


0.52 % 


8.4 


xz 


1 


SEQ ID NO: 8904 


1 0.52 % 


8.4 


1 


1 Q"3 


SEQ ID NO: 8905 R 0.46 % 


7.5 


14 

X*T 


6 


SEQ ID NO: 8906 | 0.45 % 


7.2 


15 


19 


SEQ ID NO: 8907 


0.45 % 


7.2 


1 16 


110 


SEQ ID NO: 8908 


0.45 % 


7.2 


1 17 


114' 


SEQ ID NO: 8909 


J 0.45 % 


7.2 


1 18 


210 


SEQ ID NO: 8910 


I 0.45 % 


7.2 


1 19 


46 


SEQ ID NO: 8911 


I 0.42 % 


6.72 


I 20 


52 


SEQ ID NO: 8912 


I 0.37 % 


6 



HLA A24 - 10 mers 


Maximum possible score using this molecule type 


1596.672 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


92 


SEQ ID NO: 8913 1 


7.51 % 


120 


2 


42 


SEQ ID NO: 8914 1 


2.63 % 


42 


3 


109 


SEQ ID NO: 8915 I 


2.25 % 


36 


4 


23 


SEQ ID NO: 8916 


1.87 % 


30 


5 


34 


SEQ ID NO: 8917 


0.75 % 


12 


6 


6 


SEQ ID NO: 8918 


0.64 % 


10.368 


7 


45 


SEQ ID NO: 8919 


0.63 % 


10.08 


8 


196 


SEQ ID NO: 8920 


0.62 % 


10 


9 


44 


SEQ ID NO: 8921 


0.56 % 


9 


10 


40 


SEQ ID NO: 8922 


0.55 % 


8.8 


11 


62 


SEO ID NO: 8923 


0.46 % 


7.5 


12 


193 


SEO ID NO: 8924 


0.46 % 


7.5 


13 


18 


SEO ID NO: 8925 


0.45 % 


7.2 
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1 14 


113 


1 SEQ ID NO: 8926 1 


0.45 % 


1 7.2 


15 


56 


| SEQ ID NO: 8927 


0.37 % 


II 6 


16 


176 


fl SEQ ID NO: 8928 


0.37 % 


1 6 


17 


16 


1 SEQ ID NO: 8929 


0.35 % 


\ 5.6 


18 


138 


1 SEQ ID NO: 8930 


0.35 % 


1 5.6 


19 


127 


II SEQ ID NO: 8931 


0.33 % 


1 5.28 


20 


36 


I SEQ ID NO: 8932 


0.31 % 


II 5 



HLA A 0201 - 9 mere 


Maximum possible score using this molecule type 


3925227.1 


RankH Start position 


1 Sequence I % of max. score 


Score 


1 II 13 || SEQ ID NO: 8933 U 0.04 % 


1793.676528 


2 || 87 | SEQ ID NO: 8934 II 0.03% 


1415.3832 


3 || 24 H SEQ ID NO: 8935 1 0.01 % 


618.0996816 


4 || 19 II SEQ ID NO: 8936 H 0.00% 


223.23708 


5 || 12 


SEQ ID NO: 8937 8 0.00 % 


210.36400875 


6 || . 51 


SEQ ID NO: 8938 S 0.00 % 


198.30859992 


1 7 H .53 


SEQ ID NO: 8939 U 0.00 % 


194.477328 


(81 88 


SEQ ID NO: 8940 II 0.00 % 


180.58536756 


9 | 106 


SEQ ID NO: 8941 II 0.00 % 


169.74828 


10 II 54 


SEQ ID NO: 8942 II 0.00 % 


70.09848 


11 II 59 


SEQ ID NO: 8943 II 0.00 % 


43.42032 


12 || 94 


SEQ ID NO: 8944 II 0.00 % 


41.792058 


13 | 20 


SEQ ID NO: 8945 


1 0.00 % 


37.46088108 


14 || 63 


SEQ ID NO: 8946 II 0.00 % 


35.73520902 


15 H 22 


SEQ ID NO: 8947 M 0.00 % 


20.5916435109 


16 H 47 


SEQ ID NO: 8948 II 0.00 % 


12.233222865 


17 || 66 


SEQ ID NO: 8949 B 0.00 % 


12.2199 


18 || 56 


SEQ ID NO: 8950 II 0.00 % 


11.486706 


19 II 67 


SEQ ID NO: 8951 


0.00 % 


6.416172 


1 20 II 117 


SEQ ID NO: 8952 


0.00 % 


5.827464 






HLA A 0201 - 10 mere | 


Maxim 


urn possible score using this molecule type I 


3925227.1 I 


Rank 


Start position 


Sequence 


% of max. score I 


Score I 


1 


43 


SEQ ID NO: 8953 


0.10 % 


3977.8497792 I 


2 


24 


SEQ ID NO: 8954 


0.02% || 836.2525104 \ 


3 


51 


SEQJD NO: 8955 


0.02% || 815.616432 I 


4 


49 


SEQ ID NO: 8956 


0.01 % | 660.3245145 I 
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5 


19 


ISEO ID NO: 8957 


0.00 % 


251.837856 


6 


59 


ISEO ID NO: 8958 


0.00 % 


1 159.9696 


7 


12 


ISEO ID NO: 8959 


0.00 % 


155.245377 


a 1 


45 


ISEO ID NO* 8960 


0.00 % 


141.1974531 


9 1 
^ I 


21 


ISEO ID NO: 8961 


0.00 % 


117 22672269 


10 1 

XU j 


53 


ISEO ID NO: 8962 


0.00 % 


84.55536 


1 1 I 


87 


ISEO ID NO* 8963 


0 00 % 


65 5671672 


1 9 I 

JL Z. ] 




I^FO Tn NO* 8Q64 


o on % 

V/»UU /O 


fi4 888R8fi1fi 


13 1 


1 53 


ISEO ID NO* 8965 


0 00 % 


49 13352 


14 I 


178 


ISEQ ID NO: 8966 


0 00 % 


26 087 


1 ^ I 


18 


ISEQ ID NO: 8967 


0 00 % 


24 80225Q6Q1 


16 | 


116 


ISEQ ID NO: 8968 


L o.oo % 


21.5616168 


17 I 


65 


ISEQ ID NO: 8969 


0.00 % 


20.77383 


18 | 


86 iSEQID NO: 8970 


0.00 % 


15.7068219 


19 | 


27 IISEQID NO: 8971 


0.00 % 


12.3159135 


1 20 ! 


46 iSEQID NO: 8972 


0.00 % 


11.45624789925 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 


Rank 


Start position 


| Sequence 


% of max. score 


Score 


1 


4 B SEQ ID NO: 8973 


12.5 % 


4.5 


2 


136 


I SEQ ID NO: 8974 


3.33 % . 


1.2 


3 


156 


1 SEQ ID NO: 8975 


3.33 % 


1.2 


4 


140 


I SEQ ID NO: 8976 


1.66 % 


0.6 



HLA A 1101 - 10 mers I 


Maximum possible score using this molecule type 


• 36 I 


Rank 


Start position 


Sequence 


% of max. score 


Score | 


1 


169 


SEQ ID NO: 8977 


5.55 % 


2 I 


2 


94 


SEQ ID NO: 8978 B 3.33 % 


1.2 | 




| HLA B7 - 9 mers 1 


iMaximum possible score using this molecule type 


5400 


Rank || Start position 


Sequence 


% of max. score 


Score 


1 1 II 146 


SEO ID NO: 8979 


0.74 % 


40 


1 2 II 154 


SEQ ID NO: 8980 


0.74 % 


40 


3 II 80 


SEQ ID NO: 8981 


0.66 % 


I 36 


4 || 139 | 


SEQ ID NO: 8982 


0.33 % 


1 18 


5 II 83 | 


SEQ ID NO: 8983 


0.22 % 


12 
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6 


209 


SEQ ID NO: 8984 


0.22% | 12 i 


7 


7 


SEQ ID NO: 8985 


0.11 % j 6 


8 


3 


SEQ ID NO: 8986 


0.07 % | 4 


9 


6 


SEQ ID NO: 8987 


0.07 % || 4 


10 


12 


SEQ ID NO: 8988 


0.07 % | 4 


11 


19 


SEQ ID NO: 8989 


0.07 % If 4 


12 


24 


SEQ ID NO: 8990 


0 07 % II 4 


13 


38 


SEO ID NO: 8991 


0 07 % II 4 


14 


46 1 


SEO ID NO: 8992 


0 07 % U 4 


15 


56 


SEQ ID NO: 8993 


1 0.07 % H 4 


16 


110 


SEQ ID NO: 8994 


1 0.07 % 1 4 


17 


114 


SEQ ID NO: 8995 


0.07 % || 4 


18 


123 


SEQ ID NO: 8996 


0.07 % II 4 


19 


129 


SEQ ID NO: 8997 


0.07 % 11 4 


1 20 


! 166 


SEQ ID NO: 8998 


0.07 % H 4 




I HLA B7 - 10 mers 


jMaximum possible score using this molecule type 


5400 


Rank || Start position 


Sequence , 


; % of max. score 


Score 


1 i 56 


SEQ ID NO: 8999 


1.48 % 


80 


2 j 40 


SEQ ID NO: 9000 


0.74 % 


40 


3 | 127 


SEQ ID NO: 9001 


0.74 % 


40 


4 | 170 


SEQ ID NO: 9002 


0.74 % 


40 


5 


1 140 


SEQ ID NO: 9003 


0.27 % 


15 


6 


1 35 I 


SEQ ID NO: 9004 


0.22 % 


12 


7 


1 79 


SEQ ID NO: 9005 


0.22 % 


12 


8 


1 82 ! 


SEQ ID NO: 9006 


0.22 % 


12 


9 


208 | 


SEQ ID NO: 9007 


0.22 % 


12 


10 


! 209 


SEQ ID NO: 9008 


0.22 % 


12 


11 


| 80 


SEQ ID NO: 9009 


0.16 % 


9 


12 


1 129 


1 SEQ ID NO: 9010 


0.14 % 


8 


13 


! 138 


I SEQ ID NO: 9011 


0.11 % 


6 


14 


1 73 


I SEQ ID NO: 9012 


0.09 % 


5 


15 


1 2 


SEQ ID NO: 9013 


r ™ 0.07 % 


4 


16 


1 5 


SEQ ID NO: 9014 


0.07 % 


4 


17 


! 6 


SEQ ID NO: 9015 


0.07 % 


4 


18 


1 16 


SEQ ID NO: 9016 


0.07 % IS 4 


19 


1 18 


SEQ ID NO: 9017 


0.07 % 


L 4 


20 


1 24 


SEQ ID NO: 9018 


0.07 % 


4 



-449- 



WO 2004/092360 PCT/US2004/011710 



Table 21: Epitopes for SEQ ID NO: 6047 


HLA Al - 9 mers 


Maximum possible score using this molecule type 


1 5625 


Rank 


Start position 


Sequence 


% of max. score 


1 Score 


1 


53 


SEQ ID NO: 9019 


2 % 


I 112.5 


2 


10 


SEQ ID NO: 9020 


0.08 % 


1 4.5 


3 


33 


SEQ ID NO: 9021 


0.02 % 


! 1-5 


4 


3 


SEQ ID NO: 9022 


0.00 % 


1 0.5 


5 


27 I 


! SEQ ID NO: 9023 


0.00 % 


! 0.5 


6 


29 


I SEQ ID NO: 9024 


0.00 % 


1 0.5 




HLA Al - 10 mers 


Maximum possible score using Jthis molecule type 


5625 


Rank 


Start position 


Sequence 9 % of max. score 


Score 


1 


10 


SEQ ID NO: 9025 


I 0.8% 


45 


2 


52 


SEQ ID NO: 9026 


0.2 % 


11.25 


3 


50 


SEQ ID NO: 9027 


0.04 % 


2.5 


4 


32 


SEQ ID NO: 9028 


0.02 % 


1.5 


5 


48 


SEQ ID NO: 9029 


0.02 % 


1.35 


6 


27 | SEQ ID NO: 9030 


0.00 % 


0.5 




HLA A3 -9 mers 1 


Maximum possible score using this molecule type 


12150 | 


Rank || Start position II Sequence 


I % of max. score 


Score I 


1 II 38 


SEQ ID NO: 9031 


! 1.85 % 


225 | 


2 


1 17 


SEQ ID NO: 9032 


0.02 % 


3.6 


3 




SEQ ID NO: 9033 


I 0.02 % 


2.7 


4 


1 37 


SEQ ID NO: 9034 | 0.01 % 


1.8 


5 II 27 


SEQ ID NO: 9035 || 0.01 % H 1.35 


6 II 13 


SEQ ID NO: 9036 i 0.00 % 


0.675 


7 i 14. 


SEQ ID NO: 9037 | 0.00 % 


0.6 



HLA A3 - 10 mers 


Maximum possible score using this molecule type 




H 12150 


Rank 


Start position 


Sequence 




% of max. score 


|| Score 


1 


13 


SEQ ID NO: 9038 




0.04 % 


1 6 


2 


37 


SEQ ID NO: 9039 




0.01 % 


1 2.025 
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3 


2 


SEQ ID NO: 9040 


0.00 % 


0.9 


4 


19 


SEQ ID NO: 9041 


0.00 % 


0.675 


5 


16 


SEQ ID NO: 9042 


0.00 % 


0.54 



HLA A24 - 9 mere 


Maximum possible score using this molecule ty 




lbyo.b/2 


Rank 


start position 


sequence 


^1 9 — mWA± C Mlnm^ A*+ At* At^. MS a*^. 

u /o or max. score 


Score 


1 


on 

20 


CCA jr\ MO • QA/IO 


1.2b vo 


2U 


2 


1 6 


SEQ ID NO: 9044 


U.b2 vo 




3 


5 


bnl^ID IMU: 9045 


0.51 % 


8.25 


A 

4 


| 35 


SEQ ID NO: 9046 


ft 1C w 

1 U.OD vo 


C VC 

b./o 


5 


1 -31 

I 31 


SEQ ID NO: 9047 


0.35 % 


i 5.b 


6 


1 43 


SEQ ID NO: 9048 


0.27 % 


4.4 


7 


I 13 


SEQ ID NO: 9049 


0.26 % 


4.2 


o 

8 


1 o*> 


SEQ ID NO: 9050 


0.21 % 


JJb 


9 


. 3 , 2 

1 2 


SEQ ID NO: 9051 


0.11 % 


i O 
l.O 




1 Q 
I 


SEQ ID NO: 9052 


0.10 % 


l.OO 


11 


8 


SEQ ID NO: 9053 


0.09% 




12 


15 


SEQ ID NO: 9054 


0.09 % 


1.5 


13 


23 


SEQ ID NO: 9055 | 


I 0.09 % 


1.5 


14 


27 


SEQ ID NO: 9056 || 0.08 % 


1.4 


15 


24 


SEQ ID NO: 9057 


0.07 % 


1.2 | 


16 


7 


SEQ ID NO: 9058 


0.06 % 


1 


17 


17 


SEQ ID NO: 9059 


0.06 % 


1 


18 


10 


SEQ ID NO: 9060 


0.05 % 


0.9 


19 


39 


SEQ ID NO: 9061 


0.04 % 


0.792 


20 


47 


SEQ ID NO: 9062 


I 0.04 % 


0.792 



HLA A24 - 10 mere 


Maximum possible score using this molecule ty 


pe 


1596.672 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


5 


SEQ ID NO: 9063 | 


2.63 % 


42 


2 


34 


SEQ ID NO: 9064 | 


0.54 % 


8.64 


3 


30 


SEQ ID NO: 9065 | 


0.52 % 


8.4 


4 


19 


SEQ ID NO: 9066 [ 


0.50 % 


8 


5 


50 


SEQ ID NO: 9067 i 


0.33 % 


5.28 


6 


12 


SEQ ID NO: 9068 


0.26 % 


4.2 


7 


L 31 


SEQ ID NO: 9069 1 


0.21 % 


3.36 


8 


26 


SEQ ID NO: 9070 I 


0.15 % 


2.52 
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9 


1 8 


1 SEQ ID NO: 9071 


0.13 % 


1 2.1 | 


10 


22 


I SEQ ID NO: 9072 


0.12 % 


1 2 


11 


23 


11 SEQ ID NO: 9073 


0.11 % 


II 18 


12 


6 


I SEQ ID NO: 9074 


0.09 % 


I 15 


13 


14 


II SEO ID NO: 9075 


0.09 % 


II 15 


14 


16 


|| SEQ ID NO: 9076 


0 09 % 


II 15 


15 


7 


|| SEQ ID NO: 9077 


0 06 % 


B i 

H J. 


16 


48 


1 SEQ ID NO: 9078 


0.04 % 


II 0.75 


17 


0 


II SEQ ID NO: 9079 


0.04 % 


8 0.72 


18 


9 


II SEQ ID NO: 9080 


0.04 % 


II 0.72 


19 


47 


1 SEQ ID NO: 9081 


0.04 % 


1 0.66 


20 


39 


|| SEQ ID NO: 9082 


0.03 % 


II 0.6 



I HLA A 0201 - 9 mere 


iMaximum possible score using this molecule type 


. 3925227.1 


Rank 


Start position 


Sequence 


I % of max. score 


j Score 






SEQ ID NO: 9083 


0.00 % 


14.1442686 


2 


1 27 


SEQ ID NO: 9084 


I 0.00 % 


9.598176 


3 | 22 


SEQ ID NO: 9085 1 0.00 % 


9.5634 


4 || 9 


SEQ ID NO: 9086 || 0.00 % 


5.546246013 


5 B 2 


SEQ ID NO: 9087 || 0.00 % 


5.526462816 


6 || 24 


SEQ ID NO: 9088 j 0.00 % 


4.88163753 


7 | 17 


SEQ ID NO: 9089 II 0.00 % 


3.699285408 


8 


1 31 


SEQ ID NO: 9090 II 0.00 % 


2.29699206 


-4-J § 


SEQ ID NO: 9091 1 0.00 % 


2.0016040674 


io n 7 


SEQ ID NO: 9092 II 0.00 % 


0.91287 


11 II 49 


SEQ ID NO: 9093 II 0.00 % 


0.71805678 


12 || 16 


SEQ ID NO: 9094 1 0.00 % 


0.6694257042 


13 II 12 


SEQ ID NO: 9095 1 0.00 % 


0.6539828625 




HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 \ 


Rank 


Start position 


Sequence | 


% of max. score 


Score 


1 


16 


SEQ ID NO: 9096 1 


0.00 % 


34.28765802 


2 


19 


SEQ ID NO: 9097 1 


0.00 % 


18.9368775 


3 


14 


SEQ ID NO: 9098 1 


0.00 % 


14.1442686 


4 


27 


SEQ ID NO: 9099 1 


0.00 % 


11.406528 


5 


26 


SEQ ID NO: 9100 1 


0.00 % 


10.9304361558 


6 


34 


SEQ ID NO: 9101 1 


0.00 % 


5.580927 
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7 


6 


SEQ ID NO: 9102 


0.00 % 


4.865742 


8 


9 


SEQ ID NO: 9103 


0.00 % 


2.64106953 


9 


50 


SEO ID NO: 9104 


0.00 % 


2.6275752 


10 


30 


SEQ ID NO: 9105 


0.00 % 


2.29699206 


11 


7 


1 SEQ ID NO: 9106 


0.00 % | 0.86083641 


12 


42 


ISEQ ID NO: 9107 


0.00 % 


0.7049592 


13 


22 


SEQ ID NO: 9108 


0.00 % 


0.6628440357 


14 


2 


SEQ ID NO: 9109 


0.00 % 


0.6530644656 



HLA A 1101 - 9 mers 


Maximum possible score using this molecule type 


36 


1 Rank II Start position 


Sequence 


% of max. score 


Score 


1 i 37 


. SEQ ID NO: 9110 


15% 


5.4 


2 || 38 


SEQ ID NO: 9111 


2.22 % 


0.8 



HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


36 


Rank 


1 Start position I 


Sequence 


% of max. score 


Score 




1 37 | 


SEQ ID NO: 9112 


7.5% 


2.7 



HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


35 


SEQ ID NO: 9113 


3.70 % 


200 


2 


17 


SEQ ID NO: 9114 


0.11 % 


6 


3 


6 


SEQ ID NO: 9115 


0.07 % 


4 \ 


4 


| 20 


SEQ ID NO: 9116 


0.07 % 


4 


5 


31 


SEQ ID NO: 9117 


0.07 % 


4 


6 


f 43 


SEQ ID NO: 9118 


0.07 % 


4 


7 


j 7 


SEQ ID NO: 9119 


0.03 % 


2 


8 


23 


SEQ ID NO: 9120 


0.02 % 


1.2 


9 


24 


SEQ ID NO: 9121 


0.02 % 


1.2 


10 


1 10 


SEQ ID NO: 9122 


0.01% | 0.9 




I HLA B7 - 10 mers 


iMaximum possible score using this molecule type 


5400 




1 Start position R Sequence 


I % of max. score 


Score 




1 35 


SEQ ID NO: 9123 


0.09 % 


5 


2 


19 


SEQ ID NO: 9124 


0.07 % 


4 
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\ 3 


30 


SEO ID NO: 9125 


0.07% II 4 1 


I 4 


34 


SEO ID NO: 9126 


n 07 % 

\J.\Jt /o 




5 


7 


SEO ID NO: 9127 


0.03 % 


2 I 


6 


16 


SEO ID NO: 9128 


0.03 % 


1.8 I 


7 


23 


SEO ID NO: 9129 


0.02 % 


1.2 | 


8 


50 


SEO ID NO: 9130 


I 0.02 % 


1.2 | 


9 


9 


SEO ID NO: 9131 


0.01 % 


1 1 


Table 22: Epitopes for SEQ ID NO: 6048 


HLA Al - 9 mers 


Maximum possible score using this molecule type H 5625 


Rank H Start position 


Sequence 


% of max. score \ Score 


1 


[ 66 


SEQ ID NO: 9132 


0.44 % | 25 


2 


80 


SEQ ID NO: 9133 


0.08 % 


1 5 | 


3 


\ 93 


SEQ ID NO: 9134 


0.04% | 


2.7 


4 


I 11 1 


SEQ ID NO: 9135 


0.04 % 


2.5 


5 


i 89 H SEQ ID NO: 9136 


0.04 % 


2.25 


6 0 48 II SEQ ID NO: 9137 


0.01 % 


1 


7 1 3 


I SEQ ID NO: 9138 


I 0.00 % 


0.5 


8 H 9 


I SEQ ID NO: 9139 


! 0.00 % 


0.5 


9 II 56 


1 SEQ ID NO: 9140 


0.00 % 


0.5 


10 II 101 B SEQ ID NO: 9141 


0.00 % 


0.5 


11 i; 106 II SEQ ID NO: 9142 


0.00 % 


0.5 


12 || 110 || SEQ ID NO: 9143 


0.00 % 


0.5 



HLA Al - 10 mers 


Maximum possible score using this molecule type 


1 5625 


Rank 


Start position 


Sequence 


% of max. score 


| Score 


1 


30 


SEQ ID NO: 9144 


0.4 % 


I 22.5 


2 


88 


SEQ ID NO: 9145 


0.12 % 


I 6.75 


3 


48 


SEQ ID NO: 9146 [ 


0.04 % 


1 2.5 


4 


55 


SEQ ID NO: 9147 I 


0.02 % 


I 1.25 


5 


13 


SEO ID NO: 9148 


j_ 0.01 % 


1 0.9 


1 6 


79 


SEQ ID NO: 9149 


0.01 % 


0.75 


1 7 


93 


SEO ID NO: 9150 


0.01 % 


0.675 | 


1 8 


2 


SEQ ID NO: 9151 


0.00 % 


0.5 I 


1 9 


8 


SEO ID NO: 9152 


0.00 % 


0.5 | 



-454- 



WO 2004/092360 



PCT/US2004/011710 



10 


65 ~l SEQ ID NO: 9153 


0.00% B 0.5 


11 


66 II SEQ ID NO: 9154 


0.00 % 


0.5 


12 


l 80 1 


SEQ ID NO: 9155 


0.00 % 


0.5 


13 


105 


SEQ ID NO: 9156 


0.00 % 


0.5 


14 


109 n 


SEQ ID NO: 9157 


f 0.00 % 


0.5 




HLA A3 - 9 mers 


Maximum possible score using this molecule type 


12150 


Rank 


I Start position 


Sequence fl % of max. score 


Score 


1 


j 109 


SEQ ID NO: 9158 I 0.74 % 


90 


2 


1 3 


SEQ ID NO: 9159 H 0.24 % 


| 30 


3 


I 111 | SEQ ID NO: 9160 || 0.12 % 


15 


4 


I 106 • j SEQ ID NO: 9161 M 0.07 % 


9 


5 || 95 || SEQ ID NO: 9162 H 0.05 % 


6.075 


6 II 101 II SEQ ID NO: 9163 B 0.04 % 


6 


7 H 110 II SEQ ID NO: 9164 || 0.02 % 


3.6 


8 U 84 II SEQ ID NO: 9165 || 0.02 % 


3 


9 1 80 


I SEQ ID NO: 9166 || 0.02 % 


2.7 


10 II 37 


1 SEQ ID NO: 9167 H 0.01 % 


2.25 


11 II 9 S 


I SEQ ID NO: 9168 I 0.01 % 


2 


12 || 54 | 


[ SEQ ID NO: 9169 H 0.01 % 


2 


13 1 99 | 


1 SEQ ID NO: 9170 || 0.01 % 


1.35 


14 1 1 1 


| SEQ ID NO: 9171 || 0.01 % 


1.215 


15 II 11 1 


| SEQ ID NO: 9172 || 0.00 % 


0.9 


16 II 15 I 


I SEQ ID NO: 9173 j 0.00 % 


0.9 


17 || 69 I 


! SEQ ID NO: 9174 B 0.00 % 


0.6 


18 II 5 ' SEQ ID NO: 9175 II 0.00 % 


0.54 


19 II 103 


I SEQ ID NO: 9176 I 


I 0.00 % 


0.54 




HLA A3 - 10 mers 


Maximum possible score using this molecule type II 12150 


Rank || Start position 


Sequence 


% of max. score II Score 


1 II 75 


SEQ ID NO: 9177 


0.49 % 1 60 


2 II 109 


SEQ ID NO: 9178 


0.29 % II 36 


3 B 22 


SEQ ID NO: 9179 


0.14% 1 18 ! 


4 II 15 


SEQ ID NO: 9180 


0.04 % H 6 


5 II 110 


SEQ ID NO: 9181 


0.01% II 2.25 


1 6 : ; 95 


SEQ ID NO: 9182 


0.01 % II 1.8 


7 || 101 


SEQ ID NO: 9183 


0.01 % II 1.35 
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8 


43 8 SEOIDNO 


' 9184 


0.00 % 1 


1 


9 


2 1 SEOIDNO 


: 9185 


0.00 % 


0.9 


10 


5 


SEO ID NO 


: 9186 


0.00 % 


0.9 


11 


7 


SEO ID NO 


: 9187 


0.00 % 


0.9 


12 


107 


SEO ID NO 


: 9188 


0.00 % B 0.9 


13 


102 


SEQ ID NO 


: 9189 


L 0.00% || 0.81 


14 


3 


SEO ID NO 


: 9190 


0.00% | 0.75 


15 


8 


SEQ ID NO 


: 9191 


0.00 % H 0.6 


16 


103 


SEQ ID NO 


: 9192 


0.00% || 0.54 



HLA A24 - 9 mers 


Maximum possible score using this molecule type 


1596.672 


Rank 


Start position 


Sequence _| 


°/o of max. score 


Score 


1 


88 


>bcQ ID NO: 9193 | 


1.66 % 


26.6112 


z 


— — 
// 


bkQ ID inU. 9194 1 


ft -j-j o/ 

U.// vo 


12.32 


•a 
o 


1 Q 


3tlj ID INU. 9193 1 


U.DO vo 


o 

% y 


4 


108 


SEO ID NO' 9196 


0 56 % 

\J m mj \J /U 


y 


5 


92 


SEQ ID NO: 9197 | 


0.54 % 


8 64 


6 


96 


SEQ ID NO: 9198 | 


0.54 % 


8.64 


7 


73 


SEQ ID NO: 9199 I 


0.46 % 


7.5 1 


8 


40 


SEQ ID NO: 9200 I 


0.45 % 


7.2 


9 


104 


SEQ ID NO: 9201 | 


0.42 % 


6.72 


10 


8 


SEQ ID NO: 9202 I 


0.41 % 


6.6 


11 


21 


SEQ ID NO: 9203 ! 


0.37 % 


6 


12 


102 


SEQ ID NO: 9204 I 


0.37 % 


6 


13 


22 


SEQ ID NO: 9205 I 


0.25 % 


4 


14 


68 


SEQ ID NO: 9206 I 


0.25 % 


4 


15 


106 


SEQ ID NO: 9207 I 


0.22 % 


3.6 


16 


1 


SEQ ID NO: 9208 I 


0.18 % 


3 


17 


79 


SEQ ID NO: 9209 I 


0.18 % 


3 


18 


93 


SEQ ID NO: 9210 I 


0.18 % 


3 


19 


101 


SEQ ID NO: 9211 I 


0.18 % 


3 


20 


37 


SEQ ID NO: 9212 I 


0.15 % 


2.4 



— ■ ■ ■- ■■ 




HLA A24 - 10 mers 




Maximum possible score using this molecule type 


1596.672 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


100 


SEQ ID NO: 9213 


0.93 % 


15 


2 


18 


SEQ ID NO: 9214 


0.78 % 


12.6 
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3 


98 


H SEO ID NO: 9215 1 


0.52 % 


1 


| 8.4 


4 


1 73 


II SEO ID NO: 9216 1 


0.46 % 


1 


1 7.5 


5 


1 91 


II SEO ID NO: 9217 | 


0.45 % 


1 


i 7.2 


6 I 103 


H SEO ID NO: 9218 1 


0.42 % 


1 


1 6.72 


7 


1 7 


II SEO ID NO: 9219 1 


0.41 % 


1 


1 6.6 


8 


i ->i 
I 21 


\ SEO ID NO: 9220 1 


0.37 % 


1 


1 6 


9 


( A £1 

1 46 


II SEO ID NO: 9221 1 


0.37 % 


H 


1 6 


10 


1 m 

1 93 


II SEO ID NO: 9222 1 


0.37 % 


1 


1 6 




1 96 


II SEO ID NO: 9223 1 


0.37 % 

Va* a *a» # * w 




1 6 


— ■ 
12 


1 mi 
I 101 


1 SEO ID NO" 9224 1 

II «JLa\^ XL/ llVl -7 «— • | 


0 37 % 




1 6 


13 


1 77 


II SEO ID NO" 9225 I 


0.25 % 

Va*.^— ^a* #U 


1 


1 4 


14 


1 92 


II SEO ID NO" 9226 1 


I 0.22 % 


1 


1 3.6 


15 




1 SEO ID NO" 9227 


0.22 % 


II 3.6 


16 




II SEO ID NO" 9228 

|| Whs^ JLLaT llWI «a»*a><to^/ 


0.18 % 




1 3 


17 


1 53 


1 SEQ ID NO: 9229 


0.18 % 




! 3 


18 


1 36 


SEQ ID NO: 9230 


0.12 % 




1 2 


19 


I 55 


SEQ ID NO: 9231 


0.12 % 




1 2 


20 


I 102 


SEQ ID NO: 9232 


0.11 % 




1 1.8 




HLA A 0201 - 9 mers | 


Maximum possible score using this molecule type 


3925227.1 I 


Rank] 


Start position 


Sequence II % of max. score 


Score | 


1 | 


84 


SEO ID NO: 9233 \ 


0.01 % 


441.342216 


2 | 


102 


SEQ ID NO: 9234 | 


0.00 % 


63.16728165 


3 I 


107 


SEO ID NO: 9235 \ 


0.00 % 


51.882640425 


4 I 


1 


SEQ ID NO: 9236 1 


0.00 % ! 


43.8816609 


5 I 


95 


SEQ ID NO: 9237 \ 


0.00 % 1 


33.40165248 


6 I 


I 2 


SEO ID NO: 9238 II 


0.00 % | 


24.66305226 


7 | 


! 92 


SEO ID NO: 9239 II 


0.00 % I 


22.64458905 


8 I 


| 103 


SEQ ID NO: 9240 II 


0.00 % [ 


20.70206586 


9 I 


1 47 


SEO ID NO: 9241 II 


0.00 % i 


11.175953184 


10 1 


I 94 


SEQ ID NO: 9242 B 


0.00 % I 


8.452983 I 


11 i 15 I 


SEQ ID NO: 9243 H 


0.00 % 


8.1793152 


12 


\ 8 II SEQ ID NO: 9244 || 


0.00 % 


4.993461 


13 


I 5 II SEQ ID NO: 9245 II 


0.00 % 


4.57284528 


14 


1 99 II 


SEQ ID NO: 9246 || 


0.00 % 


3.999468528 


15 


1 105 1 


SEQ ID NO: 9247 II 


0.00 % 


2.231322 1 


16 


1 20 1 


SEQ ID NO: 9248 || 


0.00 % 


1.3524 


17 1 62 II SEQ ID NO: 9249 || 


0.00 % 


0.8631693 
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18 


6 


SEO ID NO: 9250 


0.00 % 


0.824619 


19 


57 


SEO ID NO: 9251 


0.00 % 


0.72105 


20 


58 


SEO ID NO: 9252 


0.00 % 


0.7147572 




HLA A 0201 - 10 mere 


Maximum possible score using this molecule type 




Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


101 


SEQ ID NO: 9253 


0.03 % 


1243-078056 


2 


3 


SEQ ID NO: 9254 


0.01 % 


r"A"J C\ A A A ~\ 

592.944462 


3 


106 


SEQ ID NO: 9255 


0.00 % 


94.2678 


4 


5 


SEQ ID NO: 9256 


0.00 % 


A *3 A ino^ 

43.42032 


5 


107 


SEQ ID NO: 9257 \ 0.00 % 


33.30332334 


6 


102 


SEQ ID NO: 9258 | 0.00 % 


32. 531ol77o 


7 


54 | 


SEQ ID NO: 9259 


0.00 % 


27.324 


8 


7 


SEQ ID NO: 9260 


0.00 % 


21.3624 


9 


1 


SEQ ID NO: 9261 


0.00% 


13.723479 


1 n 




SEQ ID NO: 9262 


0.00 % 


13.00344192 


ii 


94 


SEQ ID. NO: 9263 


0.00 % 


If) 01276^8R 


12 


99 


SEQ ID NO: 9264 


0.00 % 


5.6615328 


13 


39 


SEQ ID NO: 9265 


1 0.00 % 


3.6304212 


14 


111 


SEQ ID NO: 9266 


I 0.00 % 


2.53368 


15 


103 


SEQ ID NO: 9267 


0.00 % 


2.475394803 


16 


14 


SEQ ID NO: 9268 


1 0.00 % 


2.4519012 


17 


19 


SEQ ID NO: 9269 


I 0.00 % 


2.07604992 


18 


29 


SEQ ID NO: 9270 


1 0.00 % 


1.8179154 


19 


57 


SEQ ID NO: 9271 


! o.oo % 


1.52076 


20 


47 


SEQ ID NO: 9272 


1 0.00 % 


1.27712376 



[| HLA A 1101 - 9 triers 



Maximum possible score using this molecule type 


36 




| Start position 


! Sequence 


% of max. score J 


Score 


i 


80 


SEQ ID NO: 9273 


3.33 % i 


1.2 


2 


69 


SEQ ID NO: 9274 


1.66 % 


0.6 


3 


109 


SEQ ID NO: 9275 


1.66 % 


0.6 



HLA A 1101 - 10 mers 


Maximum possible score using this molecule type 


I 36 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


22 


SEQ ID NO: 9276 


11.11 % 


4 
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HLA B7 - 9 mers 


Maximum possible score using this molecule type 


5400 


Rank 


Start oosition 


Seauence 


9/6 of max. score 


Score 


1 

X 


22 


SEO ID NO: 9277 1 


3 70 % 


200 


2 


77 


SEO ID NO- 9278 1 


2 22 % 


120 


3 


104 


SEO ID NO" 9279 

v/ L»V^ XL/ llV/i / ^ / mJ 


0 22 % 


12 

X ^ 


4 


40 


SEO ID NO* 9280 1 

ta/L.V^ IVY/* ^ £m\J\J j 


0 11 % 


6 


5 


ft 


1 SEO ID NO* 9281 1 


0 07 % 


4 


6 


21 

^.X 


1 SEO ID NO' 9282 1 


0 07 % 

\J m\J I ill 


4 


7 

# 


68 


1 SEO ID NO" 9283 1 


0 07 % 


4 


Q 


Q2 


1 ^FO TD NO- Q2R4 1 


0 07 % 


4 


g 


102 


1 SEO ID NO" 9285 1 

| wJL_V£ XL/ 1 «V/ • J m^\j *J ] 


0 07 % 


4 


10 


46 


SEO ID NO- 9286 1 


0 03 % 

\#«L/hJ /LI 


2 


11 


98 


SEQ ID NO: 9287 | 


0.03 % 


2 

4mm 


12 


103 


SEQ ID NO: 9288 I 


0.03 % 


2 


13 


88 


SEQ ID NO: 9289 ! 


0.02% 


1.2 


14 


105 


SEQ ID NO: 9290 I 


0.01 % 


0.9 


15 


43 


SEQ ID NO: 9291 1 


0.01 % 


0.6 


16 


79 


SEQ ID NO: 9292 ! 


0.01 % 


0.6 


17 


95 


SEQ ID NO: 9293 1 


0.01 % 


0.6 


18 


107 


SEQ ID NO: 9294 I 


0.00 % 


. 0.5 



HLA B7 - 10 mers 


Maximum possible score using this molecule type 


5400 | 


Rank 


Start position 


Sequence 


% of max. score 


Score | 


1 


46 


I SEQ ID NO: 9295 


1.48 % 


1 80 | 


2 


98 


I SEQ ID NO: 9296 


1.48 % 


1 80 | 


3 


91 


I SEQ ID NO: 9297 


0.37 % 


20 | 


4 


103 


I SEQ ID NO: 9298 


0.37 % 


20 I 


5 


7 


I SEQ ID NO: 9299 


0.07 % 


4 1 


6 


21 


I SEQ ID NO: 9300 


0.07 % 


4 1 


7 


101 


I SEQ ID NO: 9301 


0.07 % 


4 1 


8 


107 


I SEQ ID NO: 9302 


0.03 % 


2 ! 


9 


67 


1 SEQ ID NO: 9303 


0.02 % 




10 


93 


1 SEQ ID NO: 9304 


0.02 % 


1.2 


11 


69 


1 SEQ ID NO: 9305 


0.01 % 


1 1 


12 


39 


I SEQ ID NO: 9306 


0.01 % 


0.6 | 


13 


77 


I SEQ ID NO: 9307 


0.01 % 


0.6 | 



-459- 
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14 


22 


SEQ ID NO: 9308 


0.00 % 0.5 1 


Table 23: Epitopes for SEQ ID NO: 6049 


HLA Al - 9 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


0 


SEQ ID NO: 9309 j 


0.2 % 


11.25 


2 


35 


SEQ ID NO: 9310 I 


0.01 % 


0.9 


3 


4 


SEQ ID NO: 9311 I 


0.00 % 


0.5 


4 


5 


SEQ ID NO: 9312 I 


0.00 % 


. 0.5 


5 


10 


SEQ ID NO: 9313 I 


0.00 % 


0.5 


6 


1 19 


SEQ ID NO: 9314 S 


0.00 % 


0.5 


7 


21 


SEQ ID NO: 9315 ! 


0.00 % 


0.5 




HLA Al - 10 mers 


Maximum possible score using this molecule type 


5625 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


0 


SEQ ID NO: 9316 


0.2 % 


1.11.25 


2 


5 


SEQ ID NO: 9317 


0.04 % 


|. 2.5 


3 


33 


SEQ ID NO: 9318 


0.02 % 


I'. 1.5 


4 


3 


SEQ ID NO: 9319 


0.02 % 


1.25 


5 


9 


SEQ ID NO: 9320 


0.00 % 


. 0.5 


6 


18 


SEQ ID NO: 9321 


0.00 % 


. 0.5 


1 7 


20 


I SEQ ID NO: 9322 


0.00 % 


0.5 



5 



HLA A3 -9 mers 


Maximum possible score using this molecule type 


12150 




1 Start position 


Sequence 


% of max. score 


Score 


1 1 


1 4 


SEQ ID NO: 9323 


0.14 % 


18 


1 2 


i i6 


SEQ ID NO: 9324 


0.11 % 


13.5 


3 


23 


SEQ ID NO: 9325 


0.06 % 


8.1 


4 


18 


SEQ ID NO: 9326 


0.03 % 


4.05 


5 


21 


SEQ ID NO: 9327 


0.01 % 


2.025 


6 


9 


SEQ ID NO: 9328 


0.01 % 


1.8 


1 7 


15 


SEQ ID NO: 9329 


0.01 % 


1.8 


8 


25 


SEQ ID NO: 9330 


0.01 % 


1.8 


9 


! 12 


I SEQ ID NO: 9331 


0.00 % 


I 0.9 
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10 


19 


cca m MO- o^^o 


n nn o/_ 


u.y 


11 


20 


SEO ID NO: 9333 


0.00 % 


0.9 


12 


2 


SEO ID NO: 9334 


0.00 % 


0.81 


13 


22 


SEO ID NO: 9335 


0.00 % 


0.81 


14 


10 


SEO ID NO: 9336 


0.00 % 


1 0.6 




HLA A3 - 10 mers 


Maximum possible score using this molecule type 


12150 


Rank 


Start position 


i Sequence 


vo of max. score | 


Score 


1 


20 


SEO ID NO: 9337 


0.16 % 


Z\j.Z.o 


2 


9 


SEO ID NO: 9338 


0.09 % | 


12 | 


3 


16 


SEQ ID NO: 9339 


! 0.07 % i 


9 I 


4 


18 


SEO ID NO: 9340 . 


0.07 % | 


9 I 


5 


22 


SEQ ID NO: 9341 


0.06 % 


8.1 | 


6 


4 


SEQ ID NO: 9342 


0.03 % 


4.05 I 


7 


15 


SEQ ID NO: 9343 


0.03 % 


4.05 I 


8 


12 


SEQ ID NO: 9344 


0.02 % 


3.6 | 


9 


3 


SEQ ID NO: 9345 


0.00 % 


0.9 | 


10 


33 


SEQ ID NO: 9346 


0.00 % 


0.6 | 


11 


2 


SEQ ID NO: 9347 


0.00 % 


0.54 | 


12 


24 


SEQ ID NO: 9348 


0.00 % 


0.54 



HLA A24 - 9 mers 


Maximum possible score using this molecule type 


1596.672 




Start position 0 


Sequence 


% of max. score 


Score 


1 


8 i 


SEQ ID NO: 9349 | 


18.78 % 


300 


2 


11 I 


SEO ID NO: 9350 


1.87 % 


30 


3 


28 1 


SEQ ID NO: 9351 


1.50 % 


24 




7 ! 


SEO ID NO: 9352 I 


0.75 % 


12 


! 


I 17 II 


SEQ ID NO: 9353 | 


0.56 % 


9 




14 II SEQ ID NO: 9354 


0.46 % 


7.5 


7 


23 II SEQ ID NO: 9355 


0.37 % 


6 


8 I 


13 I 


SEO ID NO: 9356 


[ 0.36 % 


5.76 


9 I 


2 1 


SEQ ID NO: 9357 


0.35 % 


5.6 


10 


16 1 


SEO ID NO: 9358 


0.35 % 


5.6 


11 


9 1 


SEQ ID NO: 9359 


0.30 % 


4.8 


1 12 


21 ! 


SEO ID NO: 9360 


0.26 % 


4.2 


1 13 


5 1 


SEQ ID NO: 9361 


0.25 % 


4 


1 14 


1 4 1 


SEQ ID NO: 9362 


0.22 % 


3.6 
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J 15 I 0 


^FH TD NO* Q^6^ 


0 18 % 


3 


116 


19 




0 18% 


3 


17 


10 


SEO ID NO: 9365 


0.15 % 


2.4 


18 


18 


SEO ID NO: 9366 


0.13 % 


2.1 


19 


25 


SEO ID NO: 9367 


0.06 % 


1.1 


20 


15 


SEO ID NO: 9368 


0.05 % 


0.9 




HLA A24 - 10 mers 


Maximum possible score using this molecule type I 1596.672 


Rank 


Start position 


Sequence 


% of max. score I 


Score 


1 


8 


SEQ ID NO: 9369 


22.54 % 


360 


2 


7 1 


I SEQ ID NO: 9370 | 


1.25 % 


20 


3 


17 


1 SEQ ID NO: 9371 I 


0.65 % 


10.5 


4 


15 


1 SEQ ID NO: 9372 II 0.52 % 


8.4 


5 


4 1 


1 SEQ ID NO: 9373 II 0.45 % 


7.2 


6 


22 | 


1 SEQ ID NO: 9374 1 0.37 % 


6 


7 


12 1 


I SEQ ID NO: 9375 II 0.36 % 


5.76 


8 


27 1 SEQ ID NO: 9376 | 0.30 % 


4.8 


9 


14 B SEQ ID NO: 9377 | 


I 0.28 % 1 


4.5 


10 


20 1 SEQ ID NO: 9378 | 


I 0.26 % 


4.2 


11 


10 II SEQ ID NO: 9379 I 


! 0.25 % 


4 


12 


3 


! SEQ ID NO: 9380 | 


1 0.18% 1 3 


13 


18 


1 SEQ ID NO: 9381 ! 


I 0.18 % | . 3 


14 


9 


I SEQ ID NO: 9382 I 


! 0.15% 1 2.4 


15 


24 


1 SEQ ID NO: 9383 ! 


i 0.10% 1 1.65 


16 


16 


1 SEQ ID NO: 9384 ] 


I 0.07% 1 1.2 


17 


13 


1 SEQ ID NO: 9385 I 


1 0.06% 1 1 


18 




] SEQ ID NO: 9386 


j 0.05% 1 0.9 


19 


MM 


I SEQ ID NO: 9387 


1 0.05% I! 0.84 ! 



HLA A 0201 - 9 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank I 


Start position 


Sequence 


% of max. score 


Score 


1 1 


12 


SEQ ID NO: 9388 


0.10 % 


4267.988928 


2 ! 


23 


SEQ ID NO: 93891 


0.03 % 


1360.69088544 




9 


SEQ ID NO: 9390 1 


0.01 % 


569.948832 


hH 


16 


SEQ ID NO: 9391 1 


0.00 % 


309.0498408 


5 I 


15 


SEQ ID NO: 9392 1 


0.00% U 79.73570448 


6 I 


2 


SEQ ID NO: 9393 || 0.00 % II 51.109542 
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7 II 18 


I SEQ ID NO: 9394 1 0.00 % 


45.25539984 


8 II 25 


SEQ ID NO: 9395 i 0.00 % 


34.28765802 


9 fl 22 


c tp\ m/"\ ■ none 

SEQ ID NO: 9396 


0.00 % Hi 26.532116325 


10 II 5 


SEQ ID NO: 9397 


0.00 % II 25.26691266 


11 II 21 


SEQ ID NO: 9398 


0.00 % || 4.72873208445 


12 


II 11 


SEQ ID NO: 9399 


0.00% II 2.638538265 


13 II 8 


SEQ ID NO: 9400 


0.00% II 2.4274552038 


14 | 4 


SEQ ID NO: 9401 


1 0.00% | 1.7415324 


15 H 20 


SEQ ID NO: 9402 


0.00 % | 1.6025526 


16 U 13 


SEQ ID NO: 9403 


0.00% 1 1.453803297 


17 


S 35 


SEQ ID NO: 9404 


0.00 % H 1.36878336 


18 


1 3 


SEQ ID NO: 9405 


0.00 % || 0.824619 


19 


33 


SEQ ID NO: 9406 


0.00 % j| 0.513774 




HLA A 0201 - 10 mers 


Maximum possible score using this molecule type || 3925227.1 


Rank 


Start position 


Sequence 


% of max. score H Score 


1 


! 22 


SEQ ID NO: 94071 


0.09% B 3636.068421648 


2 


4 


SEQ ID NO: 94081 


0.02 % I 1107.960876 


3 


15 


SEQ ID NO: 94091 


0.02 % II 836.2525104 


4 


16 


SEQ ID NO: 9410 II 0.00% II 150.9313176 


5 


12 


SEQ ID NO: 941111 0.00% || 76.55002416 


6 


1 


SEQ ID NO: 941211 0.00% | 49.0273014 


7 


10 


SEQ ID NO: 9413B 0.00 % H 42.1638414747 


8 


20 


SEQ ID NO: 9414 1 0.00% || 9.29480508 


9 


24 


SEQ ID NO: 9415 1 0.00% I 9.2669346 


10 


13 


SEQ ID NO: 941611 


0.00 % H 7.96581954 


11 


21 


SEQ ID NO: 9417 1| 


0.00 % 


5.051306761875 


12 


5 


SEQ ID NO: 941811 


0.00 % 


2.6941464 


13 


11 1 


SEQ ID NO: 9419 II 


0.00 % 


2.3839914 


14 


34 


SEQ ID NO: 942011 0.00 % 


1.465422 


15 


2 


SEQ ID NO: 9421 II 0.00 % 


0.70794 


16 


9 


SEQ ID NO: 9422 U 0.00% 


0.6513048 


17 


19 


SEQ ID NO: 94231 0.00% 


0.51882640425 



HLA A 1101-9 mers 


Maximum possible score using this molecule type 


36 


Rank || Start position || Sequence || % of max. score 


Score - ) 
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1 HLA A 1101 - 10 mere 


Maximum possible score using this molecule type 


36 


Rank 


Start position || Sequence | % of max. score 


Score 


1 


L 33 || SEQ ID NO: 9424 || 1.66 % 


0.6 




HLA B7 - 9 mere 


Maximum possible score using this molecule type 


5400 


Rank 


Start position fl Sequence J % of max. score 


Score 


1 


13 H SEQ ID NO: 9425 II 0.22 % 


12 


2 


f 2 R SEQ ID NO: 9426 II 0.07 % 


4 


3 


9 || SEQ ID NO: 9427 


1 0.07 % 


4 


4 


16 H SEQ ID NO: 9428 


1 0.07 % 


4 


5 


23 


I SEQ ID NO: 9429 


1 0.07 % 


4 


6 


5 | 


1 SEQ ID NO: 9430 


1 0.02 % 


1.2 


7 


15 H SEQ ID NO: 9431 


I 0.01 % 


1 




HLA B7 - 10 mere 


Maximum possible score using this molecule type . 11 5400 1 


Rank 


Start position 


Sequence 


% of max. score | Score I 


1 


4 


SEQ ID NO: 9432 


0.07 o/o B 4 1 


2 


10 


SEQ ID NO: 9433 


0.07% || 4 1 


3 


12 


SEQ ID NO: 9434 


0.07% || 4 1 


4 


15 


SEQ ID NO: 9435 


0.07 % II 4 ! 


5 


22 


SEQ ID NO: 9436 


1 0.07% | 4 | 


6 


13 


SEQ ID NO: 9437 


0.02% || 1.2 



5 Table 24: Epitopes for SEQ ID NO: 6050 



HLA Al - 9 mere 


Maximum possible score using this molecule type 


5625 I 




1 Start position 


Sequence 


% of max. score 


Score I 


1 


47 


SEQ ID NO: 9438 


0.01 % 


0.75 I 


2 


21 


SEQ ID NO: 9439 


0.00 % 


0.5 1 


3 


53 


SEQ ID NO: 9440 


0.00 % 


0.5 | 



HLA Al - 10 mere 


Maximum possible score using this molecule type 


5625 




Start position I Sequence II % of max. score 


Score 
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1 

X 


16 fl SEO ID NO: 9441 


V^VH /O 


1 ? q 


c. 


71 


SEO ID NO: 9442 




Z.D 


3 


47 


SEO ID NO: 9443 


0.02 % 


1.5 


4 


62 


SEO ID NO: 9444 


0.01 % 


0.9 


5 


20 


SEO ID NO: 9445 


0.00 % 


0.5 


6 


38 


SEO ID NO: 9446 


0.00 % 


0.5 




HLA A3 - 9 mere 


Maximum possible score using this molecule type 


12150 


Rank 


Start position 


Sequence || % of max. score 


Score 


1 


54 1 


SEQ ID NO: 9447 \ 0.02 % 


2.7 


2 


17 | 


1 SEQ ID NO: 9448 


1 0.01 % 


2 


3 


3 I 


! SEQ ID NO: 9449 


! o.oi % 


! 1.8 




f HLA A3 - 10 mers 


Maximum possible score using this molecule type 


12150 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


22 


SEQ ID NO: 9450 


0.09 % 


12 


2 


16 


SEQ ID NO: 9451 


0.01 % 


2 


3 


54 


SEQ ID NO: 9452 


0.00 % 


0.9 



HLAA24 - 9 mere 



Maximum possible score using this molecule ty 


pe 


1596.672 


Rank || Start position 


Sequence 


% of max. score 


Score 


1 II 70 


SEQ ID NO 


: 9453 


2.10 % 


33.6 


2 || 7 


SEQ ID NO 


: 9454 


1.12 % 


18 


3 || 60 


SEQ ID NO: 


: 9455 


0.46 % 


7.5 


1 4 || 54 


SEQ ID NO: 


: 9456 


0.37 % 


6 


1 5 | 14 . 


SEQ ID NO 


: 9457 


0.31 % 


5 


6 II 19 


SEQ ID NO. 


: 9458 


0.30 % 


4.8 


7 || 47 


SEQ ID NO: 


9459 


0.30 % ! 


4.8 


8 II 12 


SEQ ID NO: 


9460 


0.25 % 


4 


9 | 15 


SEQ ID NO: 


9461 


0.25 % | 


4 


10 II 67 


SEQ ID NO: 


9462 


0.25 % ! 


4 


11 II 21 


SEQ ID NO: 


9463 


0.18 % 


3 


12 || • 37 


SEQ ID NO: 


9464 


0.06 % i 


1 


13 || 27 


SEQ ID NO: 


9465 


0.03 % 


0.5 



I HLA A24 - 10 mere 
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iMaximum possible score using this molecule type || 1596.672 


1 Rank 


Start position 


Sequence I % of max. score | Score 


1 1 


14 


SEO ID NO: 9466 I 12.52 % 200 


1 2 


7 I SEO ID NO: 9467 H 0.93 % . 15 


I -> 
1 3 


11 1 SEO ID NO: 9468 II 0.75 % 12 


4 


60 H SEO ID NO: 9469 \ 0.56 % 9 


5 


18 11 SEO ID NO: 9470 \ 0.45 % II 7.2 


6 


46 II SEQ ID NO: 9471 || 0.45 % 


\ 7.2 


7 


53 H SEO ID NO: 9472 || 0.37 % 


6 ! 


8 


69 


! SEQ ID NO: 9473 \ 0.35 % 


5.6 


y 


66 


SEQ ID NO: 9474 1 0.25 % 


4 


10 


20 


SEQ ID NO: 9475 II 0.12 % 


2 


11 


47 


SEO ID NO: 9476 


0.07 % 1 


1.2 


12 


36 


SEQ ID NO: 9477 


0.06 % ! 


1 


13 


26 


SEQ ID NO: 9478 


0.04 % | 


0.75 


1 14 


70 


SEQ ID NO: 9479 


0.04 % ' f 


0.72 ! 



HLA A 0201 - 9 mers 


Maximum possible score 


using this molecule type 


1 3925227.1 


Rank 


Start position 


Sequence 


% of max. score 


Score . 


1 


54 I 


SEQ ID NO: 9480 


1 0.02 % 


881.199 


2 


26 


SEQ ID NO: 9481 


\ 0.00 % 


95.013 


3 


61 1 


SEQ ID NO: 9482 1 


0.00 % 


93.69648 


4 


19 | 


SEQ ID NO: 9483 1 


1 0.00 % 


40.2894864 


5 


74 


SEQ ID NO: 9484 


0.00 % 


12.6684 


6 


35 I 


SEQ ID NO: 9485 


0.00 % 


10.34586 


7 


« 1 


SEQ ID NO: 9486 


0.00 % 


3.3704706 


8 


13 | 


SEQ ID NO: 9487 


0.00 % 


1.656 


9 


15 


SEQ ID NO: 9488 


0.00 % 


1.47537042 


10 


68 


SEQ ID NO: 9489 


O'.OO % 


0.966 


11 


22 | 


SEQ ID NO: 9490 


0.00 % 


0.942678 


12 


12 1 


SEQ ID NO: 9491 


0.00 % 


0.7669695 


13 || 36 


SEQ ID NO: 9492 


0.00 % 


0.52661835 



HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


61 


SEQ ID NO: 9493 


0.00 % 


93.69648 


2 


25 


SEQ ID NO: 9494 


0.00 % 


63.33035625 
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3 


34 


1 SEQ ID NO: 9495 I 


0.00 % 


H 50 232 


4 


53 || SEQ ID NO: 9496 B 


0.00 % 


0 45.2838375 


5 


26 




0.00 % 


II 14 35752 


6 


27 


1 SEQ ID NO: 9498 II 


0.00 % 


B 2 8557858 


7 


17 1 


1 SEQ ID NO: 9499 1 


0.00 % 


tt 2 1Q7^2?9 


8 


36 1 


1 SEQ ID NO: 9500 II 


0.00 % 


II 1.798209 




69 1 


1 SEQ ID NO: 9501 II 


0.00 % 


U 1.03521597 


s 


67 1 


SEQ ID NO: 9502 I 


0.00 % 


H 0.966 




68 1 


SEQ ID NO: 9503 II 


0.00 % 


II 0.910938 


I 12 


11 


SEQ ID NO: 9504 1 


0.00 % 


I 0.7669695 



HLA A 1101 - 9 mers 


Maximum possible score usinq this molecule type 


36 


RankH Start position || Sequence I % of max. score 


Score | 


1 8 17 U SEQ ID NO: 9505 II 7.?? o /n 


0.8 1 




HLA A 1101 - 10 mers 1 


Maximum possible score usinq this molecule type 


36 i 


Rank I Start position 1 


Sequence U % of max. score 


Score | 


1 II 16 1 


SEQ ID NO: 9506 II 5.55 % 


2 . | 









HLA B7 


- 9 mers 






Maximum possible score 


usinq this molecule tvDe 




5400 1 


Rank 


1 Start position 


I 


Sequence 


1 


% of max. score 


Score 


1 


1 27 


II 


SEQ ID NO 


: 9507 


II 


0.37 % 


20 


2 


I 54 


II 


SEQ ID NO 


: 9508 


II 


0.22 % 


12 


3 


1— s— 


II 


SEQ ID NO 


: 9509 


1 


0.22 % 


12 


4 




I 


SEQ ID NO 


: 9510 


1 


0.11 % 


6 


5 


1 12 


1 


SEQ ID NO 


: 9511 


1 


0.07 % 


4 


6 I 


1 U 


II 


SEQ ID NO 


: 9512 


1 


0.07 % 1 


1 4 


7 I 


1 19 




SEQ ID NO 


: 9513 


1 


0.07 % | 


1 4 


8 I 


1 49 


II 


SEQ ID NO 


: 9514 


0.03 % II 2 


1 9 1 


! 69 


II 


SEQ ID NO: 


: 9515 




0.03 % 1 


1.8 


1 io | 


47 


II 


SEQ ID NO: 


9516 




0.02 % 


1.2 


1 11 1 


5 


II 


SEQ ID NO: 


9517 




0.01 % 




1 12 | 


9 


i 


SEQ ID NO: 


9518 




0.01 % II 




1 13 | 


35 


i 


SEQ ID NO: 


9519 




0.01 % II 


1 | 


1 14 | 


37 


II 


SEQ ID NO: 


9520 


! 0.01 % 


0.6 1 


1 15 I 


68 


1 


SEQ ID NO: 


9521 


0.01 % 


0.6 I 



-467- 



WO 2004/092360 



PCT/US2004/011710 



HLA B7 - 10 mere 


Maximum possible score usinq this molecule type 




fl DHUU 


Rank | Start position 


H Sequence 


1 


°/o Of max crnra 


II Qrnro 


1 


| 


69 


II SEQ ID NO: 9522 


II 


u.uu /o 


H jo 


2 




53 


II SEQ ID NO: 9523 


1 


O 99 oa 
U.Z<1 /o 


fl 12 


3 


I 


5 


II SEQ ID NO: 9524 


I 


U.J.O vo 


II 7 c 
II 7.5 


4 


I 


66 


II SEQ ID NO: 9525 II 


n 1 1 oa 

U . 1 J. /o 


fl 6 


5 


1 


11 


II SEO ID NO: 9526 


1 


U.U/ VO 


II A 

II 4 


6 


i 


27 


1 SEQ ID NO: 9527 


1 


U.U/ vo 


II a. 
II 4 


7 




46 


II SEO ID NO: 9528 




n n*7 o/ 
U.U/ vo 


B 4 


8 


I 


18 


II SEQ ID NO: 9529 




n 09 oa 

U.UZ VO 


II 1-5 

II 1.2 


9 




9 


II SEO ID NO: 9530 1 




n ni oa 

U.UX VO 


II 1 
II 1 


10 


i 


26 


II SEQ ID NO: 9531 1 




n ni o/. 
U.U1 vo 


I 1 


11 




25 


H SEQ ID NO: 9532 1 




0.01 % 


H 0.75 


12 


i 


17 


II SEQ ID NO: 9533 1 




0.01 % 


1. 0.6 


13 


i 


36 


H SEQ ID NO: 9534 




0.01 % 


11 0.6 


14 




68 


II SEQ ID NO: 9535 II 


0.01 % 


1 0.6 


15 


» 


35 


II SEQ ID NO: 9536 II- 


0.00 % 


S 0.5 


16 


« 


42 


II SEQ ID NO: 9537 II 




0.00 % 


II 0.5 


17 




73 


II SEQ ID NO: 9538 II 




0.00 % 


II 0.5 



Table 25: Epitopes for SEQ ID NO: 6052 



1 HLA Al - 9 mere 


IMaxImi 


im possible score 


using this molecule tvDe 




1 5625 


tana 


! Start position 


il Sequence | 


% of max. score 1 


1 Score 1 


i 


I 365 


ll SEO ID NO: 9539 II 


0.8 % 


I 45 


2 


1 397 


H SEQ ID NO: 9540 II 


0.44 % | 


1 25 


3 


I 229 


H SEQ ID NO: 9541 II 


0.32 % 


1 18 


Fh 


103 


8 SEQ ID NO: 9542 || 


0.17 % 


10 




338 


i SEQ ID NO: 9543 II 


0.17 % 


10 


6 1 


251 


II SEQ ID NO: 9544 II 


0.16 % | 


9 


7 | 


79 


II SEQ ID NO: 9545 II 


0.11 % 


6.25 


8 | 


119 


II SEQ ID NO: 9546 B 


0.10 % 


6 


9 


r 361 


II SEQ ID NO: 9547 II 


0.08 % 


5 


10 


60 


B SEQ ID NO: 9548 II 


0.04 % n 


2.25 


11 


101 


II SEQ ID NO: 9549 II 


0.04 % J 


2.25 1 


12 | 


278 


\ SEQ ID NO: 9550 II 


0.04 % II 


2.25 I 
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1 1 


71 H *?FO TD NH' QR^I 


n c\o ox* 1 1 ok 

U.UZ TO H l.ZD 


14 


164 II SEO TD NO* 9552 


n (\0 %. H i nc 


1 


1fi5 II <?FO TH NO- Q55"3 


n no oa [I 1 ok 


16 


295 II SEO ID NO' 9554 


n n? % II 1 ok 

U»UZ /0 || litlj 


17 


172 8 SEO ID NO: 9555 


0.01 % || 0.9 


18 


0 | SEQ ID NO: 9556 


0.01 % || 0.75 


19 


311 | SEO ID NO: 9557 


0.01 % || 0.75 


20 


78 || SEQ ID NO: 9558 


0.01 % j 0.625 




HLA Al - 10 mere 


Maximum possible score using this molecule type H 5625 


Rank 


I Start position 


Sequence 


% of max. score || Score 


1 I 


1 114 


SEQ ID NO: 9559 


1.11% j 62.5 


2 I 


1 134 


SEQ ID NO: 9560 


0.8% H 45 1 


3 II 365 


SEQ ID NO: 9561 


0.8% | 45 | 


4 . || 77 


SEQ ID NO: 9562 


0.66% 8 37.5 1 


5 j 103 


SEQ ID NO: 9563 


0.44 % 1 25 i 


6 || 23 - 


SEQ ID NO: 9564 


0.22 % B 12.5 1 


7 ' I . 338 I 


SEQ ID NO: 9565 


0.17%. i 10 1 


8 II 361 


SEQ ID NO: 9566 


0.17% 1 10 


9 II 324 


SEQ ID NO: 9567 


0.11 % : || 6.25 


10 || 375 


SEQ ID NO: 9568 


0.11 % j 6.25 I 


11 II 79 


SEQ ID NO: 9569 


0.04 % 8 2.5 1 


12 j 295 


SEQ ID NO: 9570 


0.04 % ' H 2.5 I 


13 || 346 


SEQ ID NO: 9571 


0.04% 8 2.5 I 


14 || 378 


SEQ ID NO: 9572 


0.03% I! 2 | 


15 1 251 ! 


SEQ ID NO: 9573 


0.03% 8 1.8 | 


16 j 214 | 


SEQ ID NO: 9574 


0.02 % 1 1.125 1 


1 17 || 160 


SEQ ID NO: 9575 


0.01% | 1 | 


18 II 172 | 


SEQ ID NO: 9576 


0.01% B 0.9 1 


19 II 229 I 


SEQ ID NO: 9577 


0.01 % || 0.9 1 


20 II 376 ! 


SEQ ID NO: 9578 


0.01% || 0.9 1 




HLA A3 - 9 mere 


Maximum possible score using this molecule type 


I 12150 


Rank 


Start position 


Sequence 


% of max. score 


| Score 


1 


229 


SEQ ID NO: 9579 


0.49 % 


1 60 


2 


361 


SEQ ID NO: 9580 


0.27 % 


I 33.75 


3 


330 


SEQ ID NO: 9581 


0.16 % 


1 20 
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A 
*T 




SEO ID NO: 9582 | 0.09 % 


1? I 


c 

D 


JJU 


SEQ ID NO: 9583 H 0.04 % 


D 1 


D 




SEO ID NO: 9584 I 0.04 % 


6 1 
O 1 


7 




SEQ ID NO: 9585 || 0.04 % 


54 1 


Q 
O 


9Q1 


SEO ID NO: 9586 1 


1 0.01 % 


O 1 
Z j 


Q 


941 
Z41 


SEO ID NO: 9587 


! 0.01 % 


1 ft I 
l.o | 


1 n 


9QH 


SEO ID NO: 9588 I 


L 0.01 % 


1 ft I 

l.o I 


1 1 

XJL 


JlD 


SEQ ID NO: 9589 | 


1 0.01 % 


1 ft I 


1 9 


ZZZ 


SEQ ID NO: 9590 I 


1 0.01 % 


1 nr I 

1.35 1 


1 ^ 


ZOO 


SEQ ID NO: 9591 | 0.01 % 


1.3b \ 






SEQ ID NO: 9592 I 0.00 % 


1 | 


ID 




SEQ ID NO: 9593 U 0.00 % 


0.9 | 


J.O 


X Jo | 


SEQ ID NO: 9594 8 0.00 % 


0.9 I 


17 


240 


SEQ ID NO: 9595 || 0.00 % 


0.9 | 


18 


119 ! 


SEQ ID NO: 9596 || 0.00% 1 0.675 I 


19 


44 | 


SEQ ID NO: 9597 | 0.00 % \ 0.6 | 


20 


161 | 


SEQ ID NO: 9598 || 0.00 % | 0.6 I 




HLA A3 - 10 mers j 


Maximum possible score using this molecule type 


12150 I 


Rank 


Start position 


Sequence 


% of max. score 


Score ( 


1 


338 


SEQ ID NO: 9599 


0.49 % 


60 | 


! 2 


160 


SEQ ID NO: 9600 


0.32 % 


40 i 


3 


352 


L SEQ ID NO: 9601 


0.24 % 


30 | 


4 


361 


SEQ ID NO: 9602 


0.18 % 


22.5 | 


5 


103 


SEQ ID NO: 9603 


0.13 % 


. 16.2 | 


6 


290 


SEQ ID NO: 9604 


0.07 % 


' 9 I 


7 


351 


SEQ ID NO: 9605 


0.07 % 


9 1 


8 


44 


SEQ ID NO: 9606 


0.04 % 


6 ! 


9 


228 


SEQ ID NO: 9607 


0.03 % 


4.05 i 


10 


394 i 


SEQ ID NO: 9608 


0.02 % 


3 I 


11 


240 


SEQ ID NO: 9609 


0.02 % 


2.7 I 


12 


100 I 


SEQ ID NO: 9610 


0.01 % 


1.8 ! 


13 


114 U SEQ ID NO: 9611 


0.01 % 


1.8 1 


14 


93 


SEQ ID NO: 9612 


0.01 % 


1.5 1 


15 


134 


SEQ ID NO: 9613 


0.01 % 


1.5 | 


16 


221 


SEQ ID NO: 9614 


0.01 % 


1.35 | 


17 


330 


SEQ ID NO: 9615 


0.00 % 


1.2 | 


18 


112 


SEQ ID NO: 9616 


0.00 % 


0.9 1 
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19 fl 


218 


II SEOIDNO:9617 1 


0.00 % 




1 0.9 j 


1 20 I 


55 


II SE0IDNO:9618 » 


0.00 % 




n o.6 




HLA A24 - 9 mere 


Maximum possible score usinq this molecule type 


1 1596.672 


Rank U Start position H Sequence | o /o of msnr _ 


1 Score 


-H- 


ATT 

345 


H SEQ ID NO: 9619 II 


1.50 % 




1 24 


2 H 


306 


II SEQ ID NO: 9620 II 


0.75 % 




1 12 


II 


222 


II SEO ID NO: 9621 II 


0.54 % 




1 8.64 


4 II 


HI H 
111 


II SEQ ID NO: 9622 II 


0.51 % 




1 8.25 


q II 

3 II 


159 


li SEQ ID NO: 9623 H 


0.45 % 




1 7.2 


6 0 


219 


II SEQ ID NO: 9624 II 


0.45 % 




1 7.2 


7 H 
/ II 


283 


II SEO ID NO: 9625 II 


0.45 % 


J 


1 7.2 


ft II 
o || 


ZOO 


II SEO ID NO: 9626 H 


0.42 % 


1 


1 6.72 


Q n 
y ii 


cc 

JO 


li SEQ ID NO: 9627 II 


0.41 % 




! 6.6 


m H 

1U || 


131 


II SEQ ID NO: 9628 II 


0.37 % 




i ■'• 6 


U 1 


214 


II SEO ID NO: 9629 II 


0.37 % 


1 


.6 


12 1 


297 


II SEQ ID NO: 9630 II 


0.37 % 


1 


6 


13 II 


86 


H SEO ID NO: 9631 I 


0.31 % 


1 


5 


14 1 


122 


II SEQ ID NO: 9632 II 


0.31 % 




5 


15 1 


48 


II SEQ ID NO: 9633 II 


0.30 % 


H 


4.8 


16 | 


105 


II SEQ ID NO: 9634 II 


0.30 % 


II 


4.8 


17 1 


213 


II SEO ID NO: 9635 II 


0.30 % 


1 


4.8 


18 1 


323 


II SEQ ID NO: 9636 II 


0.30 % 


1 


•4.8 


19 I 


338 


II SEQ ID NO: 9637 II 


0.30 % 


1 


4.8 1 


20 | 


399 


II SEQ ID NO: 9638 H 


0.30 % 


li 


4.8 1 








HLA A24 - 10 mere 








Maximum possible score usinq this molecule type 




I 


1596.672 


Rank | Start position H Sequence B o /o of max . scope 




Score 




65 


H SEQ ID NO: 9639 H 


0.93 % 


f 


15 ! 




306 


II SEQ ID NO: 9640 II 


0.75 % 


I 


12 


-H- 


95 


II SEQ ID NO: 9641 II 


0.66 % 


I 


10.56 




36 


II SEQ ID NO: 9642 1 


0.60 % 


1 


9.6 


+h 


385 


II SEQ ID NO: 9643 II 


0.50 % 


II 


8 




111 


U SEQ ID NO: 9644 H 


0.46 % 


I 


7.5 


7 1 


104 


II SEQ ID NO: 9645 II 


0.45 % 




7.2 


8 1 


214 


H SEQ ID NO: 9646 I 


0.45 % 




7.2 


9 1 


221 


II SEQ ID NO: 9647 1 


0.45 % 




7.2 
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10 1 


1 277 


SEQ ID NO: 9648 


0.45 % 


7.2 I 


11 


150 


SEQ ID NO: 9649 


0.37 % 


6 


12 


152 


SEQ ID NO: 9650 


0.37 % 


6 


13 


158 


SEQ ID NO: 9651 


0.37 % 


6 


14 


171 


SEQ ID NO: 9652 


0.37 % 


6 


15 


1 343 


SEQ ID NO: 9653 


0.37 % 


6 


1 16 


110 


SEQ ID NO: 9654 


0.34 % 


5.5 


1 17 


85 


SEQ ID NO: 9655 


0.31 % 


5 


18 


47 


I SEQ ID NO: 9656 


0.30 % 


4.8 


19 


213 


SEQ ID NO: 9657 


0.30 % 


4.8 


20 


218 


I SEQ ID NO: 9658 


0.30 % 


4.8 



I HLA A 0201 - 9 mers 


Maximum possible score using this molecu 


e type 


3925227.1 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


222 


SEQ ID NO: 9659 


0.03 % 


1267.10434728 


2 


226 


^EO TD NO' QfifiO 

Jty XI-/ liV/i JvUu 


I 0.00 % 


69.552 




1 316 


SEQ ID NO: 9661 


0 00 % 




m 


I 351 


SEQ ID NO: 9662 


0.00 % 


31.24872 


5 


i 159 


SEQ ID NO: 9663 


0.00 % 


13.6235739 


6 


406 


SEQ ID NO: 9664 


0.00 % 


11.4264 


7 


165 


SEQ ID NO: 9665 


0.00 % 


8.14407 


8 I 


238 


SEQ ID NO: 9666 


0.00 % 


7.0518 


9 I 


138 


SEQ ID NO: 9667 


0.00 % 


5.112072 


10 I 


130 


SEQ ID NO: 9668 


0.00 % 


3.00547233 


11 1 


303 


SEQ ID NO: 9669 


0.00 % 


2.59578 


12 


157 


SEQ ID NO: 9670 


0.00 % 


2.412585 


13 


219 


SEQ ID NO: 9671 


0.00 % 


2.103255861 


I 14 1 305 


SEQ ID NO: 9672 


0.00 % 


1.86369 


15 || 158 


SEQ ID NO: 9673 


0.00 % 


1.646892 


16 


331 


SEQ ID NO: 9674 


0.00 % 


1.614048 


17 


399 


SEQ ID NO: 9675 


0.00 % 


1.442246832 


18 


324 


SEQ ID NO: 9676 


0.00 % 


1.319625 


19 


312 


SEQ ID NO: 9677 


0.00 % 


1.233099 


20 


262 


SEQ ID NO: 9678 


0.00 % 


0.966 



HLA A 0201 - 10 mers 


Maximum possible score using this molecule type 


3925227.1 


Rank || Start position II Sequence H % of max. score 


Score 
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1 


721 


SEO TD NO* 9679 


0 00 % 


"30Q 04Q840R 


£- 


r -i 1 2 

lid 


^FO TH NO* Q680 

JLy XL*/ livi JUUU 


0 00 % 


on 9fi704 

Z70.ZO/ U*T 


O 


i^o 


^fh m no- Qfisi 


0 00 ©A 

UiUU /0 


QQ 76704 


A 
*r 


1 Rfl i 

IJO I 


^FO TH NO* QfiR? 


0 00 ©A 

U.UU /o 


JO.JlUUO 


D 


91ft "1 
Z JLO [ 


OuV^ XL/ IMU. jUOJ 


0 00 OA 
UiUU /o 


OA (Y7^A0AR 


D 


1 OA I 
1ZH j 


OCV^ iU IMU. Z7QO*t 


O OO oa 
u.uu vo 


1^71 QQ 

iz.ziyy 


-7 
/ 


re I 
3D 


bty 1U IMU. yooj 


n nn oa. 
U.UU vo 


lU.^fD/D/D 


Q 


•31 c I 


CCO TO MO- OAQA 


n nn o/* 
U.UU vo 


/./Z/'I'ZU'f 


y 


icn I 
JdU [ 


CCO TO MO" OCQ7 


n nn o/ 
U.UU vo 


•^.zyooyy 


1 n 
IU 


Af\K I 
HUD j 


CPO TO MO" QCQQ 


n nn oz* 
U.UU vo 


'f.ZOD^O/ 


11 


OQQ j 
JOO J 


CeO TO MO- OAQQ 

oty iu imu. yooy 


n nn oz* 
U.UU vo 








ceo to mo- ocon 
iu imu. yoyu 


n nn oa 

U.UU /o 


j.oojoUj 


I 1^ 


1 *an I 
loU [ 


CCO TO MO" QCQ1 

oty iu imu. yoyi 


n nn oz. 
U.UU vo 


j.-HZoDyiyuj 


1 A 




CeO TO MO" QCOO 

id imu. yoyz 


n nn oa 
U.UU vo 


0.*rl lZJUuZD 


ID 


■ 1 I 


CeO TO MO- 

otv^iu nu. yoyj 


n nn oz. 
U.UU vo 


9 000/1*3 

z.yyy^io 


16 


410 1 


SEO ID NO* 9694 


0 00 % 1 


2 63718 


17 


. 316 t 


SEQ ID NO: 9695 


0.00 % 


2.48686074 


18 


104 


SEQ ID NO: 9696 


0.00 % 


2.477311485 


19 


• 164 | 


SEQ ID NO: 9697 


0.00 % 


2.2011 


20 


282 | 


SEQ ID NO: 9698 


1 0.00 % 


2.16591 



HLA A 1101 - 9 mere 


Maximum possible score using this molecule type . 


36 


Rank 


Start position 


Sequence 


% of max. score 


Score 


1 


361 


SEQ ID NO: 9699 


16.66 % 


6 


2 


53 


SEQ ID NO: 9700 


2.77 % 


1 


3 


240 


SEQ ID NO: 9701 


1.66 % 


0.6 


4 


241 


SEQ ID NO: 9702 


1.66 % 


0.6 



HLA A 1101 - 10 mere 



Maximum possible score using this molecule type 


J 


36 


Rank || Start position 


Sequence 


% of max. score | 


Score 


1 II 


361 


I SEQ ID NO: 9703 


16.66 % 


6 


2 II 


93 


I SEQ ID NO: 9704 


8.33 % ! 


3 


3 II 


338 


I SEQ ID NO: 9705 


! 3.33 % 


1.2 


4 II 


134 


! SEQ ID NO: 9706 


2.77 % | 


1 


5 II 


228 


1 SEQ ID NO: 9707 


2.5 % | 


0.9 


6 » 


160 


I SEQ ID NO: 9708 


2.22 % ! 


0.8 


i 7 ii 


239 


1 SEQ ID NO: 9709 


1.66 % 


0.6 
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8 


240 


SEO ID NO: 9710 


1.66% 1 0.6 I 


9 


257 


SEO ID NO: 9711 


1.66% 1 0.6 | 


10 


379 


SEO ID NO: 9712 


1.66% 1 0.6 1 




|" HLA B7 - 9 mers 


iMaximum possible score usinq this molecule type 


5400 


Rank Start position 


Sequence 1 % of max. score 


Score 


1 || 105 


SEQ ID NO: 9713 Q 14.81 % 


800 


2 II 66 


SEO ID NO: 9714 | 1.48 % 


80 


3 ! 


93 


SEQ ID NO: 9715 || 0.92% 


50 


4 1 257 


SEQ ID NO: 9716 H 0.55 % 


30 


5 I 


323 


SEQ ID NO: 9717 || 0.37 % 


. 20 


6 I 


211 


SEQ ID NO: 9718 II 0.22% 


12 


7 1 


219 I 


SEQ ID NO: 9719 || 0.22% , I 


12 


8 1 


, 403 


SEQ ID NO: 9720 H 0.18 % | 


10 


9 1 


343 II SEQIDNO:9721 H 0.14% i 


8 


10 || 12 II SEQ ID NO: 9722 | 0.11% 


L 6 


11 || 113 


SEQ ID NO: 9723 | 0.11% 


1 c 


12 || 48 


SEQ ID NO: 9724 II 0.07 % 


4 


13 || 56 


SEQ ID NO: 9725 M 0.07 % 


. 4 


14 j 150 


SEQ ID NO: 9726 || 0.07 % 


4 


15 || 153 


SEQ ID NO: 9727 


| 0.07 % 


4 


16 II 159 


SEQ ID NO: 9728 


I 0.07 % 


4 


17 || 213 


SEQ ID NO: 9729 


I 0.07 % ; 


4. 


1 18 U 216 


SEQ ID NO: 9730 H 0.07 % 


4 


I 19 || 222 


SEQ ID NO: 9731 j 0.07 % • 


I 4 


I 20 || 283 


SEQ ID NO: 9732 || 0.07 % • 


1 4 




[_ HLA B7 - 10 mers 


Maximum possible score using this molecule type 1 5400 1 


Rank 


Start position 


Sequence H % of max. score II Score 


1 


' 36 


SEQ ID NO: 9733 | 1.48 % B 80 


2 


150 


SEQ ID NO: 9734 || 1.48 % H 80 


3 


343 1 SEQ ID NO: 9735 | 1.48 % 1 80 


4 


12 II SEQ ID NO: 9736 || 1.11 % II 60 


5 


308 || SEQ ID NO: 9737 


1 1.11% 1 60 


6 


130 i SEQ ID NO: 9738 


j 0.37% II 20 


7 


55 II SEQ ID NO: 9739 


1 0.22% 1 12 


8 


210 II SEQ ID NO: 9740 


| 0.22% II 12 
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q 


218 


SEO ID NO: 


9741 


0.22 % 


12 




201 


SEO ID NO: 


9742 

«/ f it 


0 18 % 

VilU /if 


10 


1 1 

-L x 


121 


SEO ID NO: 


9743 

«/ # r 


0 14 % 


8 


1 2 


3Q1 


SEO ID NO* 


9744 


0 13 % 


7 5 


1 3 


112 

J. X£> 


SEO ID NO* 


9745 


0 11 % 

V ■ X X /U 


6 


14 


JOJ 


SEO ID NO' 


9746 


0 11 % 

V ■ X X /(J 


6 


1 ^ 

xO 


4.7 


TD NO* 


Q747 


0 07 % 


4 


16 


! 66 


SEQ ID NO: 


9748 


0.07 % 


4 


17 


95 


SEQ ID NO: 


9749 


0.07 % 


4 


18 


104 


I SEQ ID NO: 


9750 


0.07 % 


4 


19 


152 


SEQ ID NO: 


. 9751 


0.07 % 


4 


20 


! 158 


I SEQ ID NO 


: 9752 


0.07 % 


4 
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TABLE 26: Cloned sequences for Exoti expression 



ORF 


DNA length 
bp 


Cloning 


pET 


pGEX 


P28 


537 


Ndel/Xhol 




P65 


1917 


Nhel / HinriTTT 

JL lllwjl # JLJ I UU III 




NsplA 


2495 


NheT / Xhol 

XlliwX / AUvl 




NsplB 


2153 


Ndel / Xhol ! 




NsplC 


2612 


NdeT / Xhol 




Nsp2A 


431 


NdpT / XhnT 


"RamHT / XhoT 

mil i nil / zujui 


I Nsd2B 


426 \ 


NHpT / XhnT 


"RamTTT / XhoT 


Nsp3 


870 


NdpT / XhnT 

XI LLC A / j\U\JX. 




Nsn4 


249 | 


NHpT / XhnT 


"RaniTTT / XhoT 


NsdS 


594 


NhpT / XhnT 




Nsp6 


339 


NdeT / XhoT 

liUCl / ,/YAAl/A 


BamHI / Xhol 


Nsp7 i 


417 


NdeT / XhoT 


BamHI / XhoT 


Nsp9A 


1385 


NheT /Xhol 




Nsp9B 


1409 


NdeT / Xhol 




NsplO 


1803 


NheT / Xhol 

XlXiwX f X\H\J±. 




Nspll 


1581 


Ndel/Xhol 




Nspl2 


1038 


Ndel / FfinrfTTT 




Nspl3 


897 


Ndel/Xhol 




Spike (SI) 


1946 


Ndel /Xhol 




Spike (S2) 


1598 


Ndel /Xhol 




i Spike (S1-S2) 


3545 


Ndel /Xhol 




HR1 


287 


Ndel /Xhol 


BamHI /Xhol 


HR2 


146 


Ndel /Xhol 


BamHI /Xhol 


ORF3A100 


525 


Ndel /Xhol 




ORF4 


465 


Ndel /Xhol 




Envelope (E) . 


231 


Ndel /Xhol 


! BamHI /Xhol 


Matrix (M)A100 


366 


Ndel /Xhol 


BamHI /Xhol 


ORF7A18 


137 


Ndel /Xhol 


BamHI /Xhol 


ORF8 


i 369 


Ndel /Xhol 


BamHI /Xhol 


ORF9 


135 


| Ndel /Xhol 


BamHI /Xhol 


ORF10 


120 


Nhel /Xhol 


BamHI /Xhol 


ORF11 


1 255 


Ndel /Xhol 


BamHI /Xhol ! 


Nucleocapsid (N) 


1269 


Ndel/EcoRI 




ORF12 


297 


Ndel/EcoRI 


BamHI / EcoRI 
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TABLE 27: Primers 



ORF 


Forward primer 


Reverse primer 


P28 


9803 


9818 


P65 


9804 


9819 


NsplA 


9805 


9820 


NsplB 


9806 


9821 


NsplC 


9807 


9822 


I Nsp2 + Nsp3 


9808 


9823 


Nsp4toNsp7 


9809 


9824 


Nsp9A 


9810 


9825 


Nsp9B 


9811 


9826 


NsplO 


9812 


9827 


Nspll 


9813 


9828 


Nspl2-Nspl3 


9814 


9829 j 


ORF3-ORF4 


9815 


9830 


Env-ORFIO 


9816 


9831 


ORF11-ORF12 


9817 


9832 


LE 28: Primers ~ 


ORF 


Forward primer 


Reverse primer 


Nsp2A 


SEQIDNO:9833 


SEQ ID NO: 9858 


Nsp2B 


SEQ ID NO: 9834 


SEQ ID NO: 9859 


Nsp3 


SEOIDNO:9835 


SEQ ID NO: 9860 


Nsp4 


SEQIDNO:9836 


SEQ ID NO: 9861 


Nsp5 


SEQ ID NO: 9837 


SEQ ID NO: 9862 


Nsp6 


SEQIDNO:9838 


SEQ ID NO: 9863 


Nsp7 


SEQ ID NO: 9839 


SEQ ID NO: 9864 


Nspl2 


SEQ ID NO: 9840 


SEQ ID NO: 9865 


Nspl3 


SEQ ID NO: 9841 


SEQ ID NO: 9866 


Spike SI 


SEQ ID NO: 9842 


SEQ ID NO: 9867 


Spike S2 


SEQ ID NO: 9843 


SEQ ID NO: 9868 


Spike S1-S2 


SEQ ID NO: 9844 


SEQ ID NO: 9869 


HR1 


SEQ ID NO: 9845 


SEQ ID NO: 9870 


HR2 


SEQ ID NO: 9846 


SEQ ID NO: 9871 


Orf3A100 


SEQ ID NO 9847 


SEQ ID NO: 9872 


[ Orf4 


SEQ ID NO: 9848 


SEQ ID NO: 9873 


EnvE 


SEQ ID NO: 9849 


SEQ ID NO: 9874 


Matrix MA100 


SEQ ID NO: 9850 


SEQ ID NO: 9875 


Orf7A18 


SEQ ID NO: 9851 


SEQ ID NO: 9876 


Orf8 


SEQ ID NO: 9852 


SEQ ID NO: 9877 


! Or© 


SEQ ID NO: 9853 


SEQ ID NO: 9878 


OrflO 


SEQ ID NO: 9854 


SEQ ID NO: 9879 


Orfll 


SEQ ID NO: 9855 


SEQ ID NO: 9880 


Nucleocapsid N 


SEQ ID NO: 9856 


SEQ ID NO: 9881 


| Orfl2 


SEQ ID NO: 9857 


SEQ ID NO: 9882 
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TABLE 29: Cloning, purification and expression in Kcoli 



SARSCoVORFs 


M.W 
Kd 


cloning 


Expr. 


purification as 


P28 


19.7 


+ 


+ 


his sol 


P65 


70,3 


+ 


+ \ 


his sol 


1 iMspi Aj n-ie rnii _ 


Oi £ 


i 

T 


■ 


hie inc 




Oli.O 








nsptc (c-term) — 


90.0 


■r J, 






MenOA /Kl-torm\ 

|! iNsp^Mj^N-iernii 


1^ ft 
io,o 






hie infi 

1119 1119 




15 5 


+ 


+ 


his sol 


Nsp3 


31.9 


+ 






I Nsd4 


9,1 


+ ! 


+ ! 


his sol ! 


I Nsp5 


21,8 


+ 


+ 


his sol 


Nsd6 


12,4 I 


+ 


.+ __ 


his sol 


Nsp7 


15,3 


+ 


+ 


his ins 


Nsp9A(N-term) : 


50,8 


+ 


- 




i Nsd9B f C-term) 


51.6 


+ 




his ins 


Nsd10 


66 








Nsp11 


pi it 








Nsd12 


38 


- 






Nsp13 


32.7 


f 


+ 


his ins 


Spike (S1-his) 


. _ 71,3 




+ 


his ins 


Spike (S2-his) 


58,6 


+ 






I Spike (S1S2-his) 


130 


+ 


+ 


his ins 


! HR1 _ 


11 


+ 


+ 


his ins 


HR2 


5,4 


+ 


+ 


his sol 


ORF3 A100 1 


19.1 


+ 


- 




ORF4 


16.9 


+ 


+ 


1 his ins (trimer) 


Envelope (E) 


34.3 


+ 


+ 


ast ins (IB) 


Matrix (MVA1 00 


! 13.3 


+ 


+ 


his ins i 


ORF7A18 2 


31 


+ 


+ 


ast sol 


ORF8 


39,5 


+ 


+ 


ast ins (IB) 


ORF9 


30.8 


| + 


+ 


gst sol 


I ORF10_ 


30.3 


+ 


+ 


ast ins (IB) 


ORF11 _ 


j 35,2 


+ 


+ 


ast ins (IB) 


Nucleocapsid (N) 


43.6 


+ 


+ 


his ins 


ORF12 


36,7 


+ 


! + 


his ins \ 
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TABLE 30: B.coti expression, purity and yield 



xTuiein 


Ton 

lag 


Puritv (%) 


Yield (mgfl) 




X XlO 


95 


1.7 






95 


4.1 


Nsp4 


Xllb 


95 


12.6 


JNSpD 


nis 


95 


5.88 


Nsp6 


Jala 


95 


8.1 


P28 


His 


95 


1 


P65 


His 


80 


0.553 


HR2 


His 


95 


11.9 


HR1 


His 


80 


2.64 


NsplA 


His 


95 


! 0.267 


Spite S1-S2 


His 


I 80 


0.381 


Matrix M 


His 


85 


12.4 


ORF7 


GST 


85 


4.9 



TABLE 31: Primers 





Rank 

XX rt I.I IV 


Model 


Local 


(Position) 


XvFZf.?*/ 


Fl 


i 




(106) 


10236 


F2 


2 


i 


(728) 


10237 


F3 


3 




(112) 


10238 


F4 


5 




(1331) 


10239 


F5 


6 




(12) ! 


10240 


F6 


6 




(346) 


10241 


F7 


8 




(904) 


10242 


F8 


9 




(1016) 


10243 


F9 


9 




(1015) 


10244 


F10 


9 




(719) 


10245 


Fll 


9 




(720) 


10246 


F12 


10 




(724) 


10247 


Rl 


2 




(1283) 


10248 


R2 


4 




(756) 


10249 


R3 


4 




(758) 


10250 


R4 


5 




(259) 


10251 


R5 


6 




(54) 


10252 


R6 


7 




(648) 


10253 


R7 


8 




(948) 


10254 


i R8 


8 




(260) 


10255 


R9 


9 




(1282) 


10256 


R10 


9 




(950) 


10257 


Rll 


9 




(756) " 


10258 


R12 


10 




(132) 



5 
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TABLE 32: Primers 



Primers List: (forward) 
Scores 



Rank 


Hodel 


Local 


Sequence 


(Position) 


PI 


7. 


1 


SEQ ID NO: 


10352 


(290) 


F2 


7 


1 


SEQ ID NO: 


10353 


(291) 


P3 


7 


1 


SEQ ID NO: 


10354 


(294) 


F4 


7 


1 


SBQ ID NO: 


10355 


(292) 


P5 


7 


1 


SBQ ID NO: 


10356 


(293) 


P6 


9 


1 


SEQ ID NO: 


10357 


(198) 


F7 


9 


1 


SEQ ID NO: 


10358 


(199) 


F8 


10 


1 


SEQ ID NO: 


10359 


(33) 


P9 


11 


1 


SBQ ID NO: 


10360 


(200) 


P10 


11 


1 


SEQ ID NO: 


10361 


(299) 


Pll 


12 


1 


SEQ ID NO: 


10362 


(298) 


F12 


12 


1 


SEQ ID NO: 


10363 


(297) 


P13 


14 


1 


SEQ ID NO: 


10364 


(35) 


F14 


14 


1 


SEQ ID NO: 


10365 


(34) 


F15 


16 


1 


SEQ ID NO: 


10366 


(300) 


PI 6 


17 


1 


SEQ ID NO: 


10367 


(295) 


F17 


17 


1 


SEQ ID NO: 


10368 


(296) 


P18 


17 


1 


SEQ ID NO: 


10369 


(175) 


PI 9 


17 


1 


SEQ ID NO: 


10370 


(36) 


P20 


20 


1 


SEQ ID .NO: 


10371 


(202) 


F21 


20 


1 


SEQ ID NO: 


10372 


(201) 


P22 


28 


1 


SEQ ID NO: 


10373 


' (204) 


P23 


28 


1 


SEQ ID NO: 


10374 


(203) 


P24 


29 


1 


SEQ ID NO: 


10375 


(269) 


P25 


29 


1 


SEQ ID NO: 


10376 


(268) 


Primers List (re 


-verse) 








Rank 


Model 


Local 


Sequence 


(Position) 


Rl 


7 


1 


SEQ ID NO: 


10377 


(337) 


R2 


9 


1 


SEQ ID NO: 


10378 


(229) 


R3 


11 


1 


SEQ ID NO: 


10379 


(230) 


R4 


11 


1 


SEQ ID NO: 


10380 


(338) 


R5 


12 


1 


SEQ ID NO: 


10381 


(207) 


R6 


12 


1 


SEQ ID NO: 


10382 


. (338) 


R7 


13 


1 


SEQ ID NO: 


10383 


(231) 


R8 


14 


1 


SEQ ID *TC>: 


10384 


(80) 


R9 


14 


1 


SEQ ID NO: 


1038S 


(232) 


RIO 


15 


1 


SEQ ID NO: 


10386 


(82) 


Rll 


16 


1 


SEQ ID NO: 


10387 


(340) 


R12 


17 


1 


SEQ ID NO: 


10388 


(83) 


R13 


17 


1 


SEQ ID NO: 


10389 


(206) 


R14 


17 


1 


SEQ ID NO: 


10390 


(82) 


R15 


17 


1 


SEQ ID NO: 


10391 


(337) 


R16 


18 


1 


SEQ ID NO: 


10392 


(341) 


R17 


20 


1 


SEQ ID NO: 


10393 


(340) 


R18 


20 


1 


SEQ ID NO: 


10394 


(233) 


R19 


21 


1 


SEQ ID NO: 


10395 


(79) 


R20 


22 


1 


SEQ ID NO: 


10396 


(213) 


R21 


28 


1 


SEQ ID NO: 


10397 


(236) 


R22 


29 


1 


SEQ ID NO: 


10398 


(317) 


R23 


32 


1 


SEQ ID NO: 


10399 


(391) 


R24 


35 


1 


SEQ ID NO: 


10400 


(57) 


R25 


36 


1 


SEQ ID NO: 


10401 


(237) 



Primers List (left part); SEQ ID NO 8 : 10402-10433 



Primers list (right part): SEQ ID NO": 10434-10464 



Primers List (forward): SEQ ID NO 8 : 10465-10484 



Primers List (reverse): SEQ ID NO 8 : 10485-10504 
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Primers list (forward) 

Scores 



Ran ?c 




ijocaj. 




Sequence 


f Position \ 


Fl 


1 


1 




ID NO: 


i a con 


\ OJ ' / 


F2 


z 


1 


oEQ 


ID NO: 


10581 


\& $y i 


F3 


z 


1 




ID NO: 


10582 


(AAf\\ 


F4 


3 


1 


SEQ 


ID NO: 


10583 


IT>Q\ 


F5 


4 


1 


SEQ 


ID NO: 


10584 


\o9o) 


F6 


4 


1 


SEQ 


ID NO: 


10585 


/ CQ*7 \ 

ioy / ) 


F7 


4 


1 


SEQ 


ID NO: 


10586 


(111) 


F8 


5 


1 


SEQ 


ID NO: 


10587 


(867) 


F9 


5 


1 


SEQ 


ID NO: 


10588 


/ o c o \ 

(868) 


F10 


5 


1 


SEQ 


ID NO: 


10589 


(869) 


Fll 


5 


1 


SEQ 


ID NO: 


10590 


(640) 


F12 


6 


1 


SEQ 


ID NO: 


10591 


(438) 


F13 


6 


1 


SEQ 


ID NO: 


10592 


(437) 


F14 


6 


1 


SEQ 


ID NO: 


10593 


t A 1 C \ 

(436) 


F15 ■ 


6 


1 


SEQ 


ID NO: 


10594 


(732) 


F16 


6 


1 


SEQ 


ID NO: 


10595 


(635) 


F17 


6 


1 


SEQ 


ID NO: 


10596 


1 A CI \ 

(457) 


F18 


6 


1 


SEQ 


ID NO:* 


10597 


i a c a \ 

(458) 


F19 


6 


1 


SEQ 


ID NO: 


10598 


(636) 


F20 


7 


1 


SEQ 


ID NO: 


10599 


(854) 


F21 


7 


1 


SEQ 


ID NO: 


10600 


(855) 


F22 


7 


1 


SEQ 


ID NO: 


10601 


(581) 


F23 


7 


1 


SEQ 


ID NO: 


10602 


(853) 


F24 


7 


1 


SEQ 


ID NO: 


10603 


(342) 


F25 


' 7 


1 


SEQ 


ID NO: 


10604 


(343) 


F26 


7 


1 


SEQ 


ID NO: 


10605 


(112) 


F27 


7 


1 


SEQ 


ID NO: 


10606 


1 QA \ 

(94) 


F28 


7 


1 


SEQ 


ID NO: 


10607 


(642) 


F29 


8 


1 


SEQ 


ID NO: 


10608 


(638) 


F30 


8 


1 


SEQ 


ID NO: 


10609 


(639) 


F31 


8 


1 


SEQ 


ID NO: 


10610 


(730) 


F32 


8 


1 


SEQ 


ID NO: 


10611 


(641) 


F33 


8 


1 


SEQ 


ID NO: 


10612 


(731) 


F34 


8 


1 


SEQ 


ID NO: 


10613 


(326) 


F35 


8 


1 


SEQ 


ID NO: 


10614 


(325) 


F36 


9 


1 


SEQ 


ID. NO: 


10615 


IDl /) 


F37 


9 


1 


SEQ 


ID NO: 


10616 


( /oi) 


F38 


9 


1 


SEQ 


ID NO: 


10617 


/ on q \ 
\Z\JO) 


r J5> 


Q 


X 


oisy 


ID NO: 


10618 


t 9HQ \ 

\ } 


F40 


9 


1 


SEQ 


ID NO: 


10619 


(702) 


F41 


9 


1 


SEQ 


ID NO: 


10620 


(210) 


F42 


10 


1 


SEQ 


ID NO: 


10621 


(634) 


F43 


10 


1 


SEQ 


ID NO: 


10622 


(694) 


F44 


10 


1 


SEQ 


ID NO: 


10623 


(693) 


F45 


10 


1 


SEQ 


ID NO: 


10624 


(728) 


F46 


10 


1 


SEQ 


ID NO: 


10625 


(695) 


F47 


10 


1 


SEQ 


ID NO: 


10626 


(95) 


F48 


11 


1 


SEQ 


ID NO: 


10627 


(455) 


F49 


11 


1 


SEQ 


ID NO: 


10628 


(456) 


F50 


11 


1 


SEQ 


ID NO: 


10629 


(454) 
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Primers last (reverse) 



Scores 



Rank 


Model 


local 




Sequent 


SO 


(Position) 


Rl 


1 


1 


SEQ 


ID NO: 


10630 


(367) 


R2 


1 


1 


SEQ 


ID NO: 


10631 


(666) 


R3 


2 


1 


SEQ 


XD NO: 


10632 


(464) 


R4 


3 


1 


SEQ 


ID NO: 


10633 


(669) 


R5 


3 


1 


SEQ 


XD NO: 


10634 


(750) 


R6 


4 


1 


SEQ 


ID NO: 


10635 


(720) 


R7 


4 


1 


SEQ 


ID NO: 


10636 


(465) 


R8 


4 


1 


SEQ 


ID NO: 


10637 


(370) 


R9 


4 


1 


SEQ 


ID NO: 


10638 


(668) 


RIO 


4 


1 


SEQ 


ID NO: 


10639 


(135) 


Rll 


5 


1 


SEQ 


ID NO: 


10640 


(901) 


R12 


5 


1 


SEQ 


ID NO: 


10641 


(667) 


R13 


6 


1 


SEQ 


ID NO: 


10642 


(609) 


R14 


6 


1 


SEQ 


ID NO: 


10643 


(464) 


R15 


6 


1 


SEQ 


ID NO: 


10644 


(665) 


R16 


6 


1 


SEQ 


ID NO: 


10645 


(486) 


R17 


6 


1 


SEQ 


ID NO: 


10646 


(356) 


R18 


6 


1 


SEQ 


ID NO: 


10647 


(758) 


R19 


7 


1 


SEQ 


ID NO: 


10648 


(366) 


R20 


7 


1 


SEQ 


ID NO: 


10649 


(368) 


R21 


7 


1 


SEQ 


ID NO: 


10650 


(136) 


R22 


7 


1 


SEQ 


ID NO: 


10651 


(675) 


R23 


7 


1 


SEQ 


ID NO: 


10652 


(366) 


R24 


7 


1 


SEQ 


ID NO: 


10653 


(608) 


R25 . 


7 


1 


SEQ 


ID NO': 


10654 


(884) 


R26 


7 


1 


SEQ 


ID NO: 


10655 


(120) 


R27 


8 


1 


SEQ 


ID NO: 


10656 


(355) 


R28 


8 


1 


SEQ 


ID NO: 


10657 


(671) 


R29 


8 


1 


SEQ 


ID NO: 


10658 


(756) 


R30 


8 


1 


SEQ 


ID NO: 


10659 


(751) 


R31 


8 


1 


SEQ 


ID NO: 


10660 


(666) 


R32 


9 


1 


SEQ 


ID NO: 


10661 


(242) 


R33 


9 


1 


SEQ 


ID NO: 


10662 


(543) 


R34 . 


9 


1 


SEQ 


ID NO: 


10663 


(724) 


R35 


9 


1 


SEQ 


ID NO : 


10664 


(482) 


R36 


10 


1 


SEQ 


ID NO : 


10665 


(121) 


R37 


10 


1 


SEQ 


ID NO": 


10666 


(662) 


R38 


10 


1 


SEQ 


ID NO: 


10667 


(750) 


R39 


10 


1 


SEQ 


ID NO: 


10668 


(719) 


R40 


10 


1 


SEQ 


ID NO: 


10669 


(242) 


R41 


11 


1 


SEQ 


ID NO: 


10670 


(484) 


R42 


11 


1 


SEQ 


ID NO: 


10671 


(375) 


R43 


11 


1 


SEQ 


ID NO: 


10672 


(728) 


R44 


11 


1 


SEQ 


ID NO: 


10673 


(373) 


R45 


11 


1 


SEQ 


ID NO: 


10674 


(998) 


R46 


11 


1 


SEQ 


ID NO: 


10675 


(486) 


R47 


12 


1 


SEQ 


ID NO: 


10676 


(881) 


R48 


12 


1 


SEQ 


ID NO: 


10677 


(882) 


R49 


12 


1 


SEQ 


ID NO: 


10678 


(244) 


R50 


12 


1 


SEQ 
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TABLE 34 



Compound 

# 


Structure 


Name 


MH+ 


1 




N-methyM-[(2-{[2-(1- 
methylethyl)phenyI]amino}-1 H- 
benzimidazol-5-y0oxy]pyridine- 
2-carboxamide 


402.5 










2 


H 


NnTiethyr-4-{[1 -methyl-2-({3- 
;(tiimethylsilyl)8thynyl]phenyl}am 
ino)-1H-benzimidazoI-5- 
ylloxy}pyridine-2-cartx)xamide 


470.6 


3 


Accra**" 4 


N-m8thyM-[(1-methyl-2-{[2- 
(phenylcarbonyl)phenyl]amino}- 
1H-benzimidazol-5- 
yl)oxy]pyridine-2-carboxamide 


478.5 


4 


o 


4-(methyloxy)-N-[6-(methyIoxy)- 
1 ,3-benzothiazol-2-yl]-3- 
nitrobenzamide 


360.4 


I 5 




4-({2-[(4-butylphenyl)amino]-1 ,3- 

benzothiazol-5-yl}oxy)-N- 

methyIpyridine-2-carboxamide 


433.5 


6 ! 




IM-methyl-4-({1 -methyl-2-[(6- 
pynrolidin-1 -^pyridin-3-yl)amino]- 
1 H-benzimidazol-5- 
yl)oxy)pYridine-2-carboxamide 


444.5 


7 




4-({2«[1 JMfri(cyclohexyl)-2- 
ylamino]-1 -methyl-1 H- 
senzim idazol-5-yI}oxy)-N- 
methylpyridine-2-carboxamide 


462.6 


8 


H^C 


4-({2-[(4-chlorophenyl)amlno]-1- 
methyl-1 H-benzimidazol-5- 
yl}oxy)-N-1,3-thiazol-2- 
ylpyridine-2-carboxamide 


477.9 



-483- 
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9 




4-{(1-methyl-2-fl2- 
(methy!oxy)phenyl]amino}-1 H- 
benzimidazol-5-yl)oxy]-N^3- 
(methyloxy)propy!foyridIne-2- 
carboxamide 


462.5 


10 




4-({2-[(4^thyipheny!)amino]-1 ,3- 

benzoxazol-5-yI}oxy)-N- 

methylpyrid]ne-2-cartx>xamide 


389.4 


11 




1 -[(3-f luorophenyI)carbonyI]-4- 
([4- 

(trffluoromethyOphenyllmethyfJpi 
perazine 


367.4 


12 




1 -[2-(ethytoxy)phenylW[3 A5- 
bis(methyloxy)phenyl]cart)onyI}p 
] perazine 


401.5 


13 


CH, 


1 -(3-chlorophenyl)-4-{[2- 

[ethyloxy)phenyi]carbony)}pipera 

zine 


345.8 


14 




3-({4-[(2E)-3-phenylprop-2- 
eny!]piperazin-1 -yl}carbonyl)-7- 
oxabicyclo[2.2.1]heptane-2- 
carboxylic acid 


371.4 


15 




1 -[2-(methyloxy)phenyQ-4- 
{[3,4,5- 

ris(methyloxy)phenyl]carbonyl}p 
perazine 


387.4 


16 


0 


3-[(4-pyridin-2-ylpiperazin-1 - 
yl)carbonyl]-7- 
oxabicyclo[2.2.1]heptane-2- 
carboxyiic acid 


332.4 



^84- 
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17 




3-pentyl-7-[(4-phenylpipera2ln-1 - 
y1)rarbonylj-24hioxo-2,3- 
dihydroquinazolin-4(1 H)-one 


437.6 


18 




W(E)-({4-[(2A 

dimethylphenyl)methyl]piperazin 
-1 -yf}imino)methyl]naphthaJen-2- 

Dl 


374.5 


19 




5~chloro-1-{[3- 

(trifluoromethy0phenyl]methyl}- 
1H-indole-2,3-dione 


340.7 


20 




1 -[(4-methylphenyI)methyrh5- 
nitro-1 H-indole-2,3-dione 


297.3 


21 


?H» 9K3O 

CH, 


1 -methyl-6,7-bis(methyloxy)-2- 
[[3-(methy1oxy)phenyl]carbonyl}- 
1 ,2,3,4-tetrahydroisoquinoline 


342.4 


22 


CCrSSbcT 

CH, 


1 -methyl-6,7-bis(methyloxy)-2- 
naphthalen-2-ylcarbonyl)- 
1 ,2,3,4-tetrahydroisoquinoline 


362.4 


23 




2- (trifiuoromethyl)phenylJmethyl 

3- [4-(aminocartx>ny])pheny1]-2- 
cycloheptyl-1 -oxo-1 ,2,3,4- 
tetrahydroisoquinoline-4- 
carboxylate 


565.6 


24 


0 


anthra[1 ,2-c][1 ,2,5]thiadiazole- 
8,11-dione 


267.3 



-485- 
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25 






benzorbloxanthrene-6,1 1 -dlone 


265.2 


26 


oil): 


o 


ethyl 6,1 1-dioxo-6,11- 

dihydrobenzo[b]phenazine-2- 

carboxylate 


333.3 


27 




N,N-dimethyl-9,1 O-dioxo-9,10- 
dihydroanthracene-1 - 
sulfonamide 


316.3 


28 






2-(trifluoromethyI)-3-{[3 l 4 l 5- 

tris(methyloxy)phenyqcarbonyi}n 

aphtho[2 f 3-blfijran-4,9-dione 


461.4 


29 


o 




2-(2-oxopropyl)-2-phenyM H- 
indene-1 , 3(2H)-dione 


279.3 


30 




ethyl 4-{5-[(3- 

nitrophenyi)carbonyl]-1 ,3-dioxo- 
1 f 3-dihydro-2H-isoindol-2- 
yllbenzoate 


445.4 


31 


o a 


5,6-dlchIoro-2-[2-ch!oro-5- 
(trif luoromethy1)phenyl]-1 H- 
isoindole-1 ,3(2H)-dione 


395.6 


32 






3-bromo-4~{[(2- 
Huorophenyl)methy0oxy}-5- 
(methyloxy)benzaldehyde 
thiosemicarbazone 


413.3 



-486- 
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33 




2-{4-(3-chlorophenyl)pipera2in-1 

ylpS-nitrobenzaldehyde 

thiosemicarbazone 



419.9 



34 




<H[2-(3- 

chlorophenyl)ethyl]arnino}-3- 

nitrobenzaldehyde 

thiosemicaibazone 



378.9 



35 




(1 E)-6,9-dimethyl-2,3,4 f 9- 
tetrahydro-1 H-carbazoM -one 
thiosemicarbazone 



287.4 



36 




252.4 



(2E)-1 ,1 '-bi(cyclohexan)-1 -en-2- 
one thiosemicarbazone 



37 



H 



4-{[2-(4- 

chlorophenyt)ethyl]amino}-3- 

nitrobenzaldehyde 

thiosemicarbazone 



378.9 



38 



y-xi 




4~(diethylamino)-2-{[(4- 
fluorophenyl)methyl]oxy}benzald 
ehyde IM-(2-piperidin-1- 
ylethyflthiosemicarbazone 



486.7 



39 




PH. 



b-CH, 



3,4-bis(methyloxy)benza)dehyde 
(1 ,1 -dioxido-1 ,2-benzisothiazok- 
3-yl)(methyl)hydrazone 



360.4 



40 



I 




(2E)-2-[(4-chlorophenyl)(5- 
chlorothien-2- 

yl)methylidene]hydrazinecarboxi 
midamide 



314.2 



^87- 



WO 2004/092360 



PCT/US2004/011710 



41 




2-(4-amino-2-oxo-1-propyH ,2- 
dihydroquinolin-3-yl)~1 H- 
benzimidazole-6-carbonitrile 


344.4 


42 




4-amino-6-fluoro-7-<{[4- 
;methyloxy)phenyl]methyI}amino 
-3-[5-(4-methylpiperazin-1 -yl)- 
1 H-benzimidazol-2-yl]quinolin- 
2(1H)-one 


528.6 


43 


YlV II 


6-chloro-3-(5-chloro-1 H- 
benzimidazol-2-yl)-4-{[2- 
dirnethylaiTiino)ethyl]amino}quin 
olin-2(1H)-one 


417.3 


44 




4-amino-5-(1 H-benzimidazol-2- 
yO-1 -methyi-1 ,7-dihydro-6H- 
)yrazolo[3,4-b]pyridin-6-one 


281.3 


45 


ifr l2 c ^ 

i - ii 

o o 


5,5-dimethyl-4-methylidene-3- 
2,4,S-trinitrophenyl)-1 ,3- 
oxazolidin-2-one 


339.2 


46 


o 


5-methyl-2-[4- 

;methyloxy)phenyl]hexahydro- 
1 H-isoindole-1 ,3(2H)-dione 


274.3 


47 


• 


5-methyl-2-(4- 

methylphenyl)hexahydro~1 H- 
isoindole-1 ,3(2H)-dione 


258.3 


48 


N^NH 


N~2~-(4-chlorophenyt)-6,6- 
dimethyH ,6-dihydro-1 ,3,5- 
triazine-2,4-diamine 


252.7 



-488- 
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49 




7Z)-7-(furan-2-ylmethylidene)-3- 

DhenyW.^ihydro-ZH- 

1 ,3lthiazolo[3£-a][1 ^rSltriazin- 

3{7H)-one 


312.4 


50 




;3aR,9R,9aR)-6 f 7-dihydroxy-9- 

3,4,5-tris(methyloxy)phenyq- 

3a,4,9,9a- 

ietrahydit>naphtho[2,3-c]furan- 
1(3H)-one 


387.4 


51 




6-chloro-2-(ethyloxy)-4-methyl-3" 
;4-nftrophenyl)-3a,4 f 9,9a- 
etrahydro-3H-pyrrolo[2,3- 
blquinoxaline 


387.8 


52 


CH, >=0 
u <-» 


ethyl 2-(ethyloxy)-4-methyl- 
3a,4,9,9a-tetrahydro-3H- 
pyrrolo[2,3-b]quinoxaline-3- 
carboxylate 


304.4 


53 


<npcH, 


ethyl 4-({[2,5- 

bis(methytoxy)phenyl]amino}met 
hyl)-3,5-dimethyM H-pyrrole-2- 
carboxylate 


333.4 


54 




1 ^3-[(6-amino-5-nitropyridin-2- 
yl)amino]propyl}-4-(2- 
;hlorophenyl)-N-[(2S)-2- 
hydroxypropyl]-1 H-pyrrole-3- 
carboxamide 


473.9 


55 




(4-methylphenyl)(5-nitro~2- 
3iperidin-1 -ylphenyOmethanone 


325.4 


56 


Ob 


(2S,5R)-N~1 ~(4-methylphenyl)- 
5-phenyl-N~2~-(2-pyridin-2- 
ylethyl)pyrrolidine-1 ,2- 
dicarboxamide 


429.5 



^89- 
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57 




2-l(3S)-3-(acetylamino)-2- 
oxopyrroIidin-1 -yl}-N-[2-(4- 
Ruorophenyl)ethyl]acetarnide 


322.4 


58 




N-[2-(2,4-dichlorophenyf)ethyl]- 
4-«(Z)-[(4,4- 

dtf!uorocyclohexy0imino][(3S)-3- 

methylpiperazin-1- 

ynmethyl}amino)benzamide 


553.5 


59 




4-[4-(methyloxy)phenyl]-5- 
phenylisoxazole 


252.3 


60 




methyl 4^[4-(1-methylethyl)-2,3- 
dioxo-7-(trifluoromethyl)-3,4- 
dihydroquinoxalin-1 (2H)- 
yflmethyllbenzoate 


421.4 


61 




(3beta,16beta)-3,14,16- 
trihydroxybufa-20,22-dienolide 


403.5 


62 


0 


2-(aminomethyl)-1 -(2-pyridin-2- 
ylethyl)quinazolin-4(1 Hy-one 


281.3 


63 


p4-f 


Bthyl 4-{[5-[3,4- 
bis(methyloxy)phenyl]-7- 
(trifluoromethyi)pyrazolo[1 ,5- 
a]pyrimidin-3- 
yfjcarbonyl}piperazine-1 - 
carboxylate 


508.5 


64 


F ~r F 


5-[3,4-bis(methyloxy)phenyl]-3- 
(piperidin-1 -ylcarbonyI)-7- 
(trifluoromethyl)pyrazolo[1 ,5- 
alpyrimidine 


435.4 



^90- 
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65 




5-p,4-bis(methytoxy)phenyO-N- 
rnethyWs*-(2i)yridiiv2-y!ethyf)-7- 
(trifluoromethyl)pyrazolo[1 ,5- 
alpyrimidine-2-cartx)xamide 


486.5 


66 


OH 


5-propyl-2-thien-2- 
ylpyrazotofl ,5-a]pyrimidin-7-ol 


260.3 



■491- 
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Table 35 





Structure ! 


Source 


Literature Reference 


Patent Number 
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^«i> .hip 

s -^o 


Avenfis Pasteur 


1) Lang, J.-M.;TouraJne, 
J.-L;Trepo,C.etal 
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ES 8602792 


3 
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4 


H3C^ChL 
H 3 C^CH3 
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5 
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HO, J 

HO 1 ^ 
< 
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Fujisawa 
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9 
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H3C ^^^^ O CH 3 
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13 
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14 
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BRIEF DESCRIPTION OF SEQUENCE LISTING 



SEQ ID NO: 


Description 


1 


Draft genome assembly from The Genome Science Center in British Colombia, 
Canada of sequence from TOR2 isolate, T0R2 drtfijienome jissembb 120403 Release 1 


2 


CDC SARS-CoV strain sequence. Entire nucleotide sequence (Urbani strain) 


3-20 


Group-specific corona virus gene products 

> Feline infectious peritonitis virus (FEPV) 

3/4 = ORF 3b; 5/6 = ORF 3X; 7/8 = ORF 3A 

> Canine corona virus 

9/10 = ORF 7b; 11/12 = ORF 7a 

> Avian infectious bronchitis virus 

13/14 = ORF 5b; 15/16 = ORF 5a; 17/18 = ORF 3a; 19/20 = ORF 3b 


/>i con 

21-520 


500 primers for left part 


521-1020 


500 primers for right part 


i rv^ i o con 

1021-3520 


Forward primers from Table 4 


3521-6020 


Reverse primers from Table 4 


6021-6026 


Figure 9 primers 


6027-6033 


Figure 11 primers 


6034-6038 


Five primers from http://corite7itnejm.org/cgi/repruit/NEJM 


6039-6051 


PEP1 toPEP13 


/AC A 

5052 


extended Pnjfl3 


6053-6056 


229E human coronavirus sequences 


6057-6060 


TGV sequences 


6061-6064 


PliD V sequences 


6065-6068 


Bovine coronavirus sequences 


o0o9-6U71 


Murine hepatitis virus sequences 


6072-6075 


AJLBV sequences « . 


6076-6 170 


Primer sequences (forward) 


6171-6265 


Primer sequences (reverse) 


6266-6304 


Primer sequences (forward) 


6305-6343 


Primer sequences (reverse) 


6344-6366 


Primer sequences (forward) 


6367-6392 


Primer sequences (reverse) 


6393-6440 


Primer sequences (forward) F1-F48 


6441-6487 


Primer sequences (reverse) R1-R47 1 


6488-6559 


Primer sequences 


6560-6568 


Primer sequences 


6569 


The nsp2 proteinase (3CL-PRO) sequence in SARS coronavirus 


OO/O- /z 


The nsp2 proteinases (3CLp) of avian IB V, MHV, and BCoV 


oj/i 


Consensus nsp2 proteinases sequence j 


CO/4-OD/7 


IG sequences from Figure 18 


oj /o 


Expression construct of nSh in pCMVllI 




Expression construct of nS in pCMVm 


03 oU 


Expression construct of nSh ATC in pCMVllI 


COol 


Expression construct of nS ATC in pCMVILT 


0jo2 


Expression construct of nSlh in pCMVIU 


Ojo3 


Expression construct of nSl in pCMVllI 




Primers for cDNA amplification 


OJOJ-ODO/ 


Primers for RT-PCR 


6588-6809 


\^\jmy\JU\iiii, od|UdlwCo Ul JTlgUXC £,D amilK) aClQS I 


6810-7179 


Component sequences of Figure 24 (>4 amino acids) 


7180-7187 


N-gJycosylation sites within SEQ ID NO: 6039 


7188-7189 


Component sequences of Figure 25 


7190 


Fragment of SEQ ID NO: 7188 


7191 


Polynucleotide encoding SEQ ID NO: 7190 


7192 


Amino acids 879-1005 of SEQ ID NO: 6042 


7193 


Amino acids 879-980 of SEQ ID NO: 6042 
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7194 


Amino acids 901-1005 ofSEQ ID NO: 6042 


7195 


Amino acids 1144-1201 ofSEQ ID NO: 6042 


7196 


Amino acids 1144-1196 of SEQ ID NO: 6042 


7197-7199 


Membrane fusion peptide regions 


7200-7206 


NadA-based polypeptides 


7207-7223 


N-glycosvIation sites within SEQ ID NO: 6042 


7224-7231 


Slippage region 


7232 


Orflab polyprotein 


7233-7244 


Orflab poiyproteins 


7245-7247 


X 2 sequences for SEQ ID NOS 7233-7244 


7248-7253 


Orflab poiyproteins 


7254 


Zinc bmding region 2 site 


7255-7271 


N-glycosylation sites in SEQ ID NOS: 6040-41,6043.604546,6050-51 


7272-7291 


Polypeptides and polynucleotides 


7292-7293 


Intergenic sequences 


7294-7301 


Nucleotides from 5' end of SARSV genome followed bv intergenic sequence 


7302-7306 


NadA constructs 


7307-7308 


Fragments of SEQ ID NO: 6042 


7309 


NadA sequence 


7310-7311 


NadA leader sequences 


7312-7315 


Amino acid sequencess from NadA 


7316-7324 


PCR primers 


7325-7330 


Primers 


7331 


CCACC sequence 


7332-7336 


3' IriK forward primers 


I 7337-7341 


3' Tj IK reverse primers 


7342-7352 


3* UTR probes 


7353-7362 


5' DTK forward primers 


7363-7373 


5* Tj tk reverse pnmers 1 


7374-7385 


5' UTR probes 


7386 


Conserved octanucleoode 


7387 


Reverse complement of SEQ ID NO: 7293 


7388 


Intergenic sequence t 


7389 


PolyT 


7390 


Stem-loop sequence 


7391-7392 


Poly-glycine linkers 


7393 


Poly-histidine tag 


7394 


Nucleocapsid epitope site 


7395 


Antisense primer i 


7396-7397 


Probes 


7398-7399 


Antigenic fragments of SEQ ID NO: 6042 


7400-7639 


T-epitope analysis of SEQ ID NO: 6039 


7640-7800 


T-epitope analysis of SEQ ID NO: 6040 


7801-8040 


T-epitope analysis of SEQ ID NO: 6041 


8041-8280 


T-epitope analysis of SEQ ID NO: 6042 


8281-8486 


T-epitope analysis of SEQ ID NO: 6043 


8487-8665 


T-epitope analysis of SEQ ID NO: 6044 


8666-8820 


T-epitope analysis of SEQ ID NO: 6045 


8821-9018 


T-epitope analysis of SEQ ID NO: 6046 


9019-9131 


T-epitope analysis of SEQ ID NO: 6047 


i 9132-9308 


T-epitope analysis of SEQ ID NO: 6048 




x -epitope analysis oi ID NO: 6049 


9438-9538 


T-epitope analysis of SEQ ID NO: 6050 


9539-9752 


T-epitope analysis of SEQ ID NO: 6052 


9753-9763 


Primers for spike protein amplification, particularly fragments of spike 


9764-9765 


N-glycosylation sites within SEQ ID NO: 6039 


9766-9779 


Cleavage products for ORFlab (Table 10) 
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9780-9782 


Forward primer, reverse primer, probe 


9783-9784 


Ly sine-rich region 


9785-9798 


Oligonucleotides used for S.cerevisiae expression 


9799-9802 


Sequences from Figures 65 & 66 


9803-9882 


Primers for Kcoli cloning 


9883-9885 


BCV nucleotide sequences for Figures 3A, 3B, 3C j 


9886-9891 


BCV amino acid sequences for Figures 4 A, 4B, 4C, 4D, 4E, 4F 


9892 


BCV 5*UTR 


9893 


BCV 3'UTR 


9894-9896 


MHV nucleotide sequences for Figures 3 A, 3B, 3C 


9897-9902 


MHV amino acid sequences for Figures 4A, 4B, 4C, 4D, 4E, 4F 


9903-9904 


AIB V nucleotide sequences for Figures 3A, 3B 


9905-9909 


AIBV amino acid sequences for Figures 4A, 4B, 4D, 4B, 4F 


9910 


AIBVS'UTR 


9911 


AIBV 3'UTR 


! 9912-9913 


HOBMFRO, HOBHEGA nucleotide sequences for Figures 3B, 3C 


9914-9918 


Human Co V amino acid sequences for Figures 4 A, 4B, 4C, 4E, 4F 


9919 i 


HCoV-OC43 5'UTR 


9920 


HCov-oc43 yum 


9921-9923 


pCMVKm2 vectors 


9924-9926 


Codon-optunised N, M and E sequences 


9927 


BNI-1 


9928-9959 


Constituent arnino acid sequences >4aa inferred from SEQ ID NO: 9927 


9960 


ORFlab variant 


9961 


\jjxrxa vanani 


9962 


Spike variant 


9963 


Membrane variant 


9964 


Nucleocapsid variant 


9965-9966 


Short ORFs 


9967 


FkA complete genome 
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CLAIMS 

1 . An isolated polypeptide of the SARS vims. 

2. Hie polypeptide of claim 1, wherein the polypeptide is a Spike (S) polypeptide, an Env 
(E) polypeptide, a Membrane (M) polypeptide, a hemagglutinin-esterase polypeptide (HE), a 

5 nucleocapsid (N) polypeptide, a ORFla polypeptide, a ORFlab polypeptide, a proteolytic 
fragment of a ORFla polypeptide, or a proteolytic fragment of a ORFlab polypeptide. 

3. The polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO 8 : 6039, 7232, 9766, 9767, 9768, 9769, 9770, 
9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 6042, 6043, 6044, 6045, 6046, 6047, 

10 6048, 6049, 6050 or 6052. 

4. The polypeptide of claim 1, wherein the polypeptide comprises an amino acid sequence 
having >75% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO s : 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 
9769, 9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 

15 10338, 10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 
10577, 10578, 10579, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

5. The polypeptide of claim 1, wherein the polypeptide comprises a fragment of at least 10 
consecutive amino acids of an amino acid sequence selected from the group consisting of SEQ 
ID NO s : 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 

20 9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 
10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 
10578, 10579, 11552, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

6. A polypeptide comprising an amino acid sequence having >80% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO s : 6042, 6043, 6044, 

25 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 9770, 9771, 9772, 9773, 
9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 10339, 10340, 10341, 
10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 10578, 10579, 11552, 
11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 

7. A polypeptide comprising an amino acid sequence that comprises a fragment of at least 
30 10 consecutive amino acids of an amino acid sequence selected from the group consisting SEQ 

ID NO s : 6042, 6043, 6044, 6045, 6046, 6047, 6048, 6049, 6050, 6052, 9766, 9767, 9768, 9769, 
9770, 9771, 9772, 9773, 9774, 9775, 9776, 9777, 9778, 9779, 9997, 9998, 10149, 10316, 10338, 
10339, 10340, 10341, 10342, 10532, 10533, 10571, 10572, 10573, 10574, 10575, 10576, 10577, 
10578, 10579, 11552, 11561, 11562, 11618, 11619, 11620, 11627, 11630, 11633 & 11636. 
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8. A polypeptide comprising an amino arid sequence having >80% sequence identity to 
SEQ ID NO: 6042, and/or comprising an amino acid sequence that comprises a fragment of at 
least 10 consecutive amino acids of SEQ ID NO: 6042, wherein the polypeptide is in the form of 
a trimer. 

5 9. Nucleic acid encoding the polypeptide of any one of claims 1 to 8. 

10. Nucleic arid according to claim 9, comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO s : 7191, 7273, 7275, 7277, 7279, 7281, 7283, 7285, 7287, 7289, 
7291, 7292, 7293, 9968, 10066, 10084, 10299, 10505, 11323, 11563, 11639 & 11640. 

11. A polynucleotide comprising a nucleotide sequence having >80% sequence identity to 
10 the nucleic acid of claim 9 or claim 10. 

12. A polynucleotide comprising a fragment of at least 10 consecutive nucleotides of the 
nucleic arid of claim 9 or claim 10. 

13. Antibody that recognizes the polypeptide of any one of claim 1 to 8. 

14. The antibody of claim 13, wherein said antibody recognizes the polypeptide comprising 
15 the amino acid sequence of SEQ ID NO: 6042 or a fragment thereof. 

15. Hie antibody of claim 14, wherein said antibody recognizes the polypeptide comprising 
the amino arid sequence of SEQ ID NO: 6042 or a fragment thereof in trimeric form. 

16. The antibody of claim 13, wherein the antibody is a monoclonal antibody, 

17. The antibody of claim 13, wherein the. antibody is a human antibody, 

20 18. An immunoassay for detecting a SARS virus antigen in a sample, comprising the step of 
contacting the sample with the antibody of any one of claims 13 to 17. 

19. An immunoassay for detecting an antibody against a SARS virus antigen in a sample, 
comprising the step of contacting the sample with the polypeptide of any one of claims 1 to 8. 

20. A method of detecting an antibody against a SARS virus antigen in a sample comprising 
25 contacting said sample with the polypeptide of any one of claims 1 to 8, under conditions 

suitable for binding said polypeptide to said antibody, if present, and detecting the binding of 
said polypeptide to said antibody. 

21. A method for detecting a SARS virus antigen in a sample comprising contacting said 
sample with the antibody of any one of claims 13 to 17, under conditions suitable for binding 

30 said antibody to said antigen, if present, and detecting the binding of said antibody to said 
antigen. 
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22. A vaccine for the treatment or prevention of severe acute respiratory syndrome (SARS), 
comprising an inactivated SARS vims, a lolled SARS virus, an attenuated SARS virus, a split 
SARS virus preparation, or at least one purified SARS virus antigens. 

23. Hie vaccine of claim 22, comprising a purified polypeptide according to any one of 
5 claims 1 to 8. 

24. The vaccine of claim 22 or claim 23, wherein the antigen is a purified SARS virus 
antigen in the form of a VLP. 

25. Hie vaccine of any one of claims 22 to 24, further comprising an adjuvant 

26. The vaccine of claim 25, wherein the adjuvant is an aluminium salt or is MF59. 

10 27. The vaccine of any one of claims 22 to 26, comprising more than one SARS virus 
antigen. 

28. The vaccine of claim 27, wherein the antigens are selected from S, E, N and M. 

29. The vaccine of claim 22, comprising an inactivated SARS virus. 

30. The vaccine of claim 29, wherein said virus is inactivated by chemical or physical means. 
15 31. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 

an effective amount of one or more of the following agents selected from the group consisting of 
detergents, formaldehyde, formalin, P-propriolactone, and UV light. 

32. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the following agents selected from the group consisting of 

20 methylene blue, psoralen and carboxyfullerene (C60). 

33. The vaccine of claim 30, wherein said inactivation comprises treatment of the virus with 
an effective amount of one or more of the following agents selected from the group consisting of 
binary ethylamine, acetyl ethyleneimine and gamma irradiation. 

34. The vaccine of claim 31, wherein said inactivation comprises treatment with P- 
15 propriolactone. 

35. The vaccine of claim 34, wherein said ^-propriolactone is used at a concentration of 0.01 
to 0.5%. 

36. Hie vaccine of claim 34, wherein said p-propriolactone is used at a concentration of 0.5 
to 0.2%. 

JO 37. Hie vaccine of claim 34, wherein said P-propriolactone is used at a concentration of 
0.025 to 0.1%. 
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38. A method of inactivating SARS virus comprising exposing the virus to an inactivation 
agent for 12 to 24 hours at refrigeration temperatures followed hydrolysis of any residual 
inactivating agent by elevating the temperature for three hours. 

39. Hie method of claim 38, wherein die inactivation agent is 0-propriolactone. 

40. Hie method of claim 38, wherein the refrigeration temperature is between 0°C and 8°C. 

41 . The method of claim 38, wherein the elevated temperature is between 33°C and 41°C. 

42. A method for making an inactivated SARS vaccine comprising: 

a. innoculating a mammalian cell culture with SARS virus; 

b. cultivating the infected cells; 

c. harvesting SARS virus containing supernatant; 

d. inactivating the SARS virus; and 

e. purifying the inactivated SARS virus. 

43. The method of claim 42, wherein said mammalian cell culture is derived from one or 
more of the cell types selected from the group consisting of fibroblast cells, endothelial cells, 
hepatocytes, keratinocytes, immune cells, mammary cells, smooth muscle cells, melanocyte 
cells, neural cells, prostate cells, renal cells, skeletal cells, liver cells, retinoblast cells and 
stromal cells. 

44. The method of claim 42, wherein said mammalian cell culture is derived from a cell 
culture selected from the group consisting of human cells, non-human primate cells, HeLa cells, 
human diploid cells, fetal rhesus lung cells, human embryonic kidney cells, VERO cells, horse 
cells, cow cells, sheep cells, dog cells, cat cells or rodent cells. 

45. The method of claim 42, wherein said mammalian cell culture is derived from VERO 
cells or fetal rhesus kidney cells. 

46. The method of claim 42, wherein said mammalian cells are cultured in serum free media. 

47. The method of claim 42, wherein said mammalian cells are cultured in protein free 
media. 

48. The method of claim 42, wherein said inoculating step comprising absorbing the SARS 
virus onto the cell culture for 60 to 300 minutes. 

49. The method of claim 42, wherein said inoculating step is conducted at 25°C to 40°C. 

50. The method of claim 42, wherein said purification step comprises one or more of the 
treatments selected from the group consisting of gradient centrifugation, ultracentrifugation, 
continuous-flow ultracentrifugation, chromatography, polyethylene glycol precipitation, and 
ammonium sulfate precipitation. 
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51. Hie method of claim 42, wherein said purification step comprises one or more of the 
treatments selected from the group consisting of ultrafiltration and dialfiltration. 

52. The method of claim 50, wherein said chromatography treatment includes one or more of 
the chromatography treatments selected from the group consisting of ion exchange 

5 chromatography, size exclusion chromatography, and liquid affinity chromatography. 

53. The method of claim 52, wherein said chromatography treatment includes use of one 
more chromatographic resins selected from the group consisting of an an anionic resin and a 
cationic resin. 

54. The method of claim 52, wherein the ion exchange chromatography treatment includes a 
10 first step using a strong anion exchange resin and a second step using a strong cation exchange 

resin. 

55. The method of claim 50, wherein said gradient centrifugation purification step comprises 
density gradient centrifugation. 

56. Hie method of claim 42, wherein said purification step comprises a first step of 
15 chromatography purification and a second step of gradient centrifugation. 

57. The method of claim 56, wherein said first chromatography purification step comprises 
liquid affinity chromatography. 

58. The method of claim 56, wherein said second gradient centrifugation step comprises 
density gradient centrifugation. 

20 59. A single-stranded oligonucleotide comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 
7332-7352, 7353-7385, 10235-10298, 10352-10504, 10580-11322 and 11325-11551. 

60. A single-stranded oligonucleotide comprising the complement of the oligonucleotide of 
claim 59. 

25 61. Hie oligonucleotide of claim 59 or claim 60, comprising 10-30 nucleotides. 

62. The oligonucleotide of claim 61, comprising the nucleotide sequence of SEQ ID NO: 
7292, SEQ ID NO: 7293, the complement of SEQ ID NO: 7292 or the complement of SEQ ID 
NO: 7293. 

63. A kit comprising primers for amplifying a template sequence contained within a SARS 
30 virus nucleic acid target, the kit comprising a first primer and a second primer, wherein the first 

primer comprises a sequence substantially complementary to a portion of said template sequence 
and the second primer comprises a sequence substantially complementary to a portion of the 
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complement of said template sequence, wherein the sequences within said primers which have 
substantial complementarity define the termini of the template sequence to be amplified. 

64. Hie kit of claim 63, wherein the template sequence is contained within SEQ ID NO: 1 
and/or SEQ ID NO: 2. 

5 65. The kit of claim 63 or claim 64, wherein the first primer comprises a ftagment of 8 or 
more nucleotides of SEQ ID NO: 1, and the second primer comprises a fragment of 8 or more 
nucleotides of the complement of SEQ ID NO: 1. 

66. The kit of claim 63 or claim 64, wherein the first primer comprises a fragment of 8 or 
more nucleotides of SEQ ID NO: 2, and the second primer comprises a fragment of 8 or more 

10 nucleotides of the complement of SEQ ID NO: 2. 

67. The kit of claim 63, wherein the first primer is an oligonucleotide according to any one of 
claims 59 to 62 and the second primer is an oligonucleotide according to any of claims 59 to 62. 

68. The kit of any one of claims 63 to 67, further comprising a labeled probe that comprises 
either a fragment of 8 or more nucleotides of SEQ ID NO: 1 and/or SEQ ID NO: 2, or th* 

15 complement of said fragment, which fragment is located within the template sequence. , 

69. The kit of any one of claims 63 to 68, wherein the first primer and/or the second primer 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NOS: 21-6020, 
6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352, 7353-7385, 10235-10298, 10352- 
10504, 10580-11322 and 11325-11551. 

10 70. The kit of any one of claims 63 to 68, wherein the first primer and/or the second primer 
comprises the complement of a nucleotide sequence selected from the group consisting/of SEQ 
ID NOS: 21-6020, 6076-6568, 6586-6587, 7292-7301, 7325-7328, 7332-7352, 7353-7385, 
10235-10298, 10352-10504, 10580-11322 and 11325-11551. 

71. A method of detecting the presence of SARS virus in a sample comprising providing a 

:5 sample suspected of containing a SARS virus nucleic acid target, amplifying a template sequence 
contained within said SARS virus nucleic acid target with the kit of any one of claims 63 to 70, 
and detecting the amplified template sequence, wherein the presence of the amplified template 
sequence indicates the presence of SARS virus in said sample. 

72. The method of claim 71 , wherein said amplifying is accomplished using polymerase 
0 chain reaction, transcription mediated amplification, reverse transcription PCR, ligase chain 

reaction, strand displacement amplification or nucleic acid sequence-based amplification. 

73. A double-stranded RNA molecule with a length from about 10 to about 30 nucleotides 
which is able to inactivate the SARS coronavirus in a mammalian cell. 
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74. The double-stranded RNA of claim 73, wherein the sequence of one of the strands is at 
least 90% identical to a target sequence, wherein the target sequence is a fragment of SEQ ID 
NO: 1 and/or SEQ ID NO: 2, 

75. Hie double-stranded RNA of claim 73 or claim 74, wherein the target sequence 

5 comprising a nucleotide sequence selected from the group consisting of SEQ ID NOS: 7292, 
7293, 7294, 7295, 7296, 7297, 7298, 7299, 7300 and 7301. 

76. Hie double-stranded RNA of any one of claims 73 to 75, comprising at least one 
modified nucleotide. 

77. A method for treating a patient suffering from SARS, comprising: administering to the 
10 patient a therapeutically effective dose of a molecule of less than 1000 g/mol. 

78. The method of claim 77, wherein the molecule has an aromatic region and greater than 
one heteroatom selected from O, S, or N. 

79. A method for treating a patient suffering from SARS, comprising: administering to the 
patient a therapeutically effective dose of a compound selected from: a nucleoside analog, a 

15 . peptoid, an oligopeptide, a polypeptide a protease inhibitor, a 3C-like protease inhibitor , a 
papain-like protease inhibitor, or an inhibitor of an RNA dependent RNA polymerase. 

80. A method for treating a patient suffering from SARS, comprising: administering to the 
patient a steroidal anti-inflammatory drug in combination with at least one antiviral compound 

81. A method for treating a patient suffering from SARS, comprising: administering to the 
20 patient a therapeutically effective dose of a compound selected from: acyclovir, gancyclovir, 

vidarabidine, foscamet, cidofovir, amantidine, ribavirin, trifluorothymidine, zidovudine* • 
didanosine, zalcitabine, an antiviral compound listed in Table 1; an antiviral compound listed in 
Table 2; or an interferon. 

82. The method of claim 81, wherein the interferon is an interferon-a or an interferon-(J. 

25 83. The method of any one of claims 77 to 82, wherein the molecule or compound is 
delivered by inhalation. 

84. A method of identifying a therapeutically active agent comprising the steps of: (a) 
contacting a therapeutically active agent with a cell infected with the SARS virus; (b) measuring 
attenuation of a SARS related enzyme. 

JO 85. A viral vector or particle for in vivo delivery of a nucleic acid of claim 9 or claim 10. 

86. The viral vector of claim 85, wherein the vector is an adenovirus vector, a poxvirus 
vector or an alphavirus vector. 

87. An alphavirus replicon particle comprising one or more SARS viral antigens. 
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88. Hie replicon particle of claim 87, wherein said SARS viral antigen is a spike protein. 

89. The replicon particle of claim 87, wherein said particle comprises a replicon derived from 
Venezuelan Equine Encephalitis (VEE) and further comprises an envelope derived from Sindbus 
virus (SIN) or Semliki Forest Virus (SFV). 

5 90. A vaccine comprising one or more SARS virus antigens and one or more respiratory 
virus antigens. 

91. The vaccine of claim 90, wherein said respiratory virus antigens are selected from the 
group consisting of influenza virus, human rhinovirus (HRV), parainfluenza virus (PIV), 
respiratory syncytial virus (RSV), adenovirus, metapneumovirus, and rhinovirus. 

10 92. The vaccine of claim 91, wherein said respiratory virus antigen is from influenza virus. 

93. The vaccine of claim 90, wherein said respiratory virus antigen is from a coronavirus 
other than the SARS virus. 

94. A polypeptide comprising an immunogenic, surface exposed fragment of the amino acid 
sequence SEQ ID NO: 6042. 

15 95. The polypeptide of claim 94, wherein said fragment does not include the last 50 amino 
acids of the C-terminus of SEQ ID NO: 6042. 

96. The polypeptide of claim 94, wherein said fragment does not include a transdomain 
region of SEQ ID NO: 6042. 

97. The polypeptide of claim 94, wherein said fragment does not include a C-terminus 
20 cytoplasmic domain of SEQ ID NO: 6042. 

98. The polypeptide of claim 94, wherein said fragment does not include a N-terminus signal 
sequence. 

99. An isolated polynucleotide comprising a nucleic acid sequence selected from the group 
consisting of SEQ ID NOS: 9968 and 10066. 

25 100. The polynucleotide of claim 99, wherein the polynucleotide comprising a nucleic acid 
sequence having > 80% sequence identity to a polynucleotide sequence selected from the group 
consisting of SEQ ID NOS: 9968 and 10066. 

101 . An isolated polynucleotide comprising a fragment of at least 15 consecutive nucleic acids 
of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 9968 and 10066 

30 and wherein said fragment does not consist entirely of SEQ ID NO: 10033. 

102. An isolated polypeptide comprising an amino acid sequence encoded by any one of 
claims 99 - 101. 
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103. Hie polypeptide of claim 102, comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 9969 - 10032, 10067, and 10015. 

104. Hie polypeptide of claim 103, wherein the amino arid sequence is selected from the 
group consisting of SEQ ID NOS: 9997, 9998 and 10015. 

5 105. An expression construct for recombinant expression of a SARS virus spike protein 
wherein said construct comprises a nucleic acid sequence selected from the group consisting of 
SEQ ID NOS: 6578 - 6583. 

106. A mammalian cell line stably expressing a SARS viral antigen. 

107. Hie cell line of claim 106, wherein said cell line is a Chinese Hamster Ovary (CHO) cell. 

10 108. The cell line of claim 106, wherein the SARS viral antigen is a spike protein or fragment 
thereof. 

109. The cell line of claim 106, wherein the spike protein is truncated to remove the 
transmembrane sequence. 

1 10. A method of identifying a therapeutically active agent comprising the steps of: (a) 
15 contacting a therapeutically active agent with a buffer comprising SARS enzyme; and (b) 

measuring attenuation of the SARS enzyme. 

111. The method of claim 110 wherein the SARS enzyme is a SARS protease. 

112. The method of claim 111 wherein the buffer further comprises a peptide with a SARS 
protease cleave site. 

20 113. The method of claim 1 10 wherein the measurement is made by the measurement of 
fluorescence. 

1 14. A vaccine of one of claims 22 to 37, and 90 to 93 further comprising an adjuvant. 

1 15. The vaccine of claim 1 14 wherein the adjuvant is a SMTP. 

1 16. The vaccine of claim 1 15 wherein the SMIP compound is selected from the group 
25 consisting of an acylpiperazine, a tryptanthrin, an indoledione, a tetrahydroisoquinoline, a 

benzocyclodione, an amino azavinyl compound, a thiosemicarbazone, a lactam, an 
aminobenzimidazole quinolinone, a hydropthalamide, a benzophenone, an isoxazole, a sterol, a 
quinazolinone, a pyrole, an anthraquinone, a quinoxaline, a triazine, an benzazole, and a 
pyimolopyrimidine, or a pharmaceutically acceptable salt, ester, or prodrug thereof. 

30 1 17. A method of vaccinating a subject comprising administering a vaccine of one of claims 
22 to 37, and 90 to 93. 

1 18. The method of claim 1 17 further comprising administering a SMIP. 
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119. A method for treating a patient of one of claims 77 to 82 further comprising 
administering at least one SMIP compound. 

120. A method for treating a patient of one of claims 77 to 82 further comprising 
administering at least one SMIS compound. 
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: -Section 8 

288 300 310 328 
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— —Section 10 
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- -Section 12 
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TGAGGATTACCATACACTGACGGTCACAATAATACGTGGTC 
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FIGURE 3B (contd.) 



" —Section 13 

(493) 483 500 J51Q 520 533 

hobmpro (490) %%^mw^^h^^^^^m^Mm 
bcvm (458) % < m^&tw§^m^^wmmm^m^ 

mhvm (461) m^^m^^^^m^^^^^^^m 

Avian infectious bmchlfis vims M (424) cgrfccfc- --- - ^c|k«M®3^^A^AGTfei^^: 
Consensus (493) AT C T T T A C AT G C AAGGT AT A AAG CT AGGTA C TGG CTATT C T 
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• Section 15 
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~ — Section 17 
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Consensus (657) GGT AAT T A C CG A CTGCC AT CAAC CCiGSAOG GTT C TGGCAT 
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SBQ ID NO: 9912 

SEQ ID NO: 9884 

SEQ ID NO: 9895 

SEQ ID NO: 9904 
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Section 7 
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HOBHEGA 
BCVHE 
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— : Section 13 
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Section 16 
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_i — _ Section 18 
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(872) ^^^tS^^^i^G^^^^^pSf f _ 

(521) fteG^^^ 

(902) TACAGGTTGTTGATTCGCGGTGGAACAATGCCAGGCAGTCTGATAACATGACQ 

—Section 19 
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BCVHE 
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(955) GC GTTGCTTGTCAACC CCGTACTGTTATTTTCGTAATTCTACTACCAATTA 
— . — —Section 20 
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HOBHEGA (978) 
BCVHE (978) 
MHVHE (627) 
Consensus (1008) 



1008 
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tAGjTGtf^^ 

TGTTGGTGTTT ATGATAT AATCATGG GATGCTGGTTTTACTAGCATAC 
_ Section 21 



(1061) 
HOBHEGA (1028) 
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Consensus (1061) 
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'MW^^^^'^^^M^^ * wrap* " — - r - r-— r~ 

^TC^QGT TT t &M^^^*^5^T^ T^CKf^A^^^Cg^^C^^^^^A^^CTT 
T AGTGGTTTGTTATATAAT^ 
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Section 22 
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(1114) 1114 .1120 .1130 ,1140 1150 



bcv he (1081) xmmfomiw&w& ^xm iQ^ „ „ , . „ 

MHV HE (733) 

Consensus (11 14) tatgataatgttagcagtgtcsggcctctctaccc TATGGCAGATGTCC AC 



• Section 23 
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■ Section 26 

(1273) 1273 

1 J7mT( 
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SEQ ID NO: 9913 
SEQ ID NO: 9885 
SEQ ID NO: 9896 
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FIGURE 4F (contd.) 



Section 19 
760 



human coronavirus OC43 S 
avianlnfedjousbronchitisvirusS 
bovine coronavfmsS 
mouse hepatitis virus S 
Consensus 



(721) 721 
(697) 
(505) 
{707) 
(656) 

(721) IKCNYVFNNSLSRQLQPINYFDSYLGCVVHADNSTSEAVQ 
-Section 20 




human coronavirus OC43 S 
avtanintediousbronchifisvlrusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(761) 761 
(737) ~ 

(545) KiSPYVSYG 
(747) 
(696) 

(761) TCDIiTVGSGYCVDYSK RRSRRSITTGYRFTNFEPFTVNS 
Section 21 




human coronavirus OC43 S 
avianinfectiousbrondi'rtisvirijsS 
bovine coronavirus S 
mouse hepatifis virus S 
Consensus 



(801)801 
(777) 

(571) ElFVftPLFHVTBN! 
(787) 
(736) 

(801) VNDSLBPVGGLYEIQIPSEFTIGNMEBFIQTSSPKVTIDC 
Section 22 




human coronavirus OC43 S 
avianinfecSousbronchriisvinisS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(841)841 
(817) ~ 
(611) 
(827) 
(776) 

(841) AAFVCGDYAACKSQLVEYGS FCDNINAILTBVNELLDTTQ 
— — Section 23 




human coronavirus OC43 S 
avfanlnfectiousbronchltisvirusS 
bovine coronavirus S 
mouse hepatitis virus $ 
Consensus 



(881)gH 
(857) ~ 
(651) 
(867) 

(816) gjMa^SlliP^ 
(881) LQVASSI,MNGVTLSTKIiKDGVNFNVDDINFSPVLGCLGSS 

Section 24 

330 g40 g50_ 960 




human coronavirus OC43 S 
avianinfectiousbronchitisvirusS 
bovine coronavirus S 
mouse hepatitis virus S 
Consensus 



(921)9*1 
(897) " 



(686) 'SBNug "L^^piTSllBSVGl 

(907) mwM iW#p?iSl 

(856) ^EDGNGPSAIRG|^^gj; 
(921) CAK SS RSAIEDLLFDKVKLSDVGFVEAYNNCT 




WO 2004/092360 



55/193 



PCT/US2004/011710 



FIGURE 4F (contd.) 
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FIGURE 4F (contd.) 
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FIGURE 6 

FIGURE 6A 




WO 2004/092360 PCT/US2004/011710 

59/193 

FIGURE 6B 
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FIGURE 6C 
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FIGURE 7 

FIGURE 7A 

SEQ ID NO: 6053 MFVLLVAYALLH 12 

SEQ ID NO: 6057 MKKLFmVVMPLIYGENFPCSKL^ 60 

SEQ ID NO: 6061 MRSLIYFWLLLPVLPTI^ VQAPAVWLGG 52 

SEQ ID NO: 6065 MFLILLISLPTAFAVIGDLKCTWS-INDVDTGVPSIS 38 

SEQ ID NO: 6069 MLFVFILPLPSQiGYIGDFRCIQLVNSNGANVS APSIS 40 

SEQ ID HO: 6042 HFIFLLFLTLTSGSDLDRCTTFDDVQAPNY^ 39 

SEQ ID NO: 6072 MLVTPLLLVTLIX]ALCSAVLYDSSS 27 



SEQ ID NO: 6053 

SEQ ID NO: 6057 YFPTVQPWFNCIENDSNDLYVTLENLK^ ITVT 120 

SEQ ID NO: 6061 YLPSMSSSWYCGTGIETASGVHGIFLSYIDSG^PEIGISQEP FDPSGYQLYLH 107 

SEQ 3D NO: 6065 TDTVDVTNGIXjTYYVLDRVYLNT^ YY 69 

SEQ 3D NO: 6069 TETVWSQGLGTYYVLDRVYLNATLLLTC YY 71 

SEQ ID 190 1 6042 VYYPDEIFRSDTLYLTQDLFLPFYSNVTGFHTJJJHTJra — -—HP 79 

SEQ ID NO: 6072 YVYYYQSAFRPPSGWHLQG 46 



SEQ ID NO: 6053 IAGCQTTN GLNTSYSVCNG 31 

SEQ 3D NO: 6057 TTRNFNSAEGMICICKGSPPTTTTESSLTC 180 

SEQ ID NO: 6061 KATNGNTNAIARLRICQFPDNKTLGPTVNDV^ 167 

SEQ ID NO: 6065 PTSGSTYRNMAMGTLLLSTLWFKPPFLSDFTNG^ 129 

SEQ 3D NO: 6069 PVDGSKFRNLALTGTNSVSLSWFQPFYl^ 131 

SEQ ID HO: 6042 VIPFKDGI YFAATEKSNVVRGWVFGSthwt^QSVI I INNSTNWI RACNFELCDNPFFA 139 

SEQ ID NO: 6072 GAYAVVNISSEFNMGSSSGCTVGIIHGGRVVNASSIAMTAP — 88 



SEQ ID NO: 6053 

SEQ ID NO: 6057 LQWFADEVVAYLHGASYRISFENQWSGTVTFGIMIACT 240 

SEQ ID NO: 6061 NDRVTVFADKIYHFYLKNOT^ 214 

SEQ ID NO: 6065 STFVNTSYSVWQPHTTILGNKLQGFLEISVCQ 171 

SEQ ID NO: 6069 SLFGYTSYTWIEPYN GVIMASVCQYTICLLFYT 165 

SEQ ID NO: 6042 VS KPMGTQTHTMIFDNAFN 158 

SEQ ID NO: 6072 



SEQ ID NO: 6053 CVCTSEIWFAVESa^IPSDFAFNNWFLL^ 74 

SEQ ID NO: 6057 KVNNKNGTTWSNCTDQCASYVANVFTTQR 300 

SEQ ID NO: 6061 WSAGEDGIYYEPCTANCTOYAANVFATDSNGHIPK 274 

SEQ ID NO: 6065 ICNPN-LGNQRVELWHWDTGWSCLYKRNFTO 230 

SEQ ID NO: 6069 DCKPNTNGNKLIGFWHTDV^ 225 

SEQ ID SO: 6042 - CTFBYISDAFSLDVSIKSGNPKHLREFV jrwmiXSFLYVYKGYQPIDVVRDLPSGFNTLK 217 

SEQ ID NO: 6072 SSGMAWSSSQPCTAHCNFSDTTVFVTHCYKHGG — CPLTGMLQQN 131 



SEQ ID NO: 6053 QPLLLNCLWSVSGLRFTTGFVYFNGTGRGD^ T 131 

SEQ ID NO: 6057 QPLLWCLWPVPSFEEAASTFCFEGAGFDQ 360 

SEQ ID NO: 6061 QPIJiWCLLAIPKIYGI^FFSF 332 

SEQ ID NO: 6065 TKFLFNVYI£TVLSHYYVMPLTCN SALTLEYWVTPLTSKQYLLAFNQDGVIFNAVD 286 

SEQ ID NO: 6069 TTFLFSVYIGDILTQYYVLPFICNPTAGSTFAPRYWVTPLV^ 285 

SEQ ID NO: 6042 PIFKLPLGINITNFRAIMAFSPAQDIWGTS 277 
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SEQ ID NO: 6072 LIRVSMKNGQLFYNLTVSVAKYPTF^ 191 



SEQ ID NO: 6053 ILFKTSYG VVVFYCTNNT-LVSGDAHIPFGTVL^^ 186 

SEQ ID NO: 6057 WSLNTTGGVTLEISCYTVSDS-SW NGTALKYLGTL 416 

SEQ ID NO: 6061 IVLHTALG TmSFVCSNSSDPHLMFMPLGATEVOTCFI^^ 389 

SEQ ID NO: 6065 CKSDPMSEIKCKTLS IAPSTGVYEI^GYTVQPIADVYRRI PNLPDCN- IEAWLNDKSVPS 345 

SEQ ID NO: 6069 CASSYTSEIKCKTQSMLPSTGVYELSGYWQP^ 344 

SEQ ID NO r 6042 CS^LAELKCSVKSFEIDKGIYQTSBFRVV^ 336 

SEQ ID NO: 6072 FKAGGPITYKVMREVKAMYFVNGTAQD 251 



SEQ ID NO: 6053 PKTVREFVISRTGHFYINGYRYFTLGNVE^ TAETTDFCTVALASYADVLV 242 

SEQ ID NO: 6057 PPSVKEIAISKWGHFYINGYNFFSTFPIDCISFNLT TGDSDVFWTIAYTSYTEALV 472 

SEQ ID NO: 6061 PPTVREIVITKYGDVYVNGFGYLH^ 449 

SEQ ID NO: 6065 PLNWERKTFSNCNFNMSSLMSFIQAYSFTC^ KEYGMCFSSITIDKFAIENG 401 

SEQ ID NO: 6069 PLNWERKTFQNCNFNLSSLLRYV^^ KVYGRCFGSISVDKFAVPRS 400 

SEQ ID BO: 6042 VYAWERKKI SNCVADYSVLYNSTFFSTFKCYGVSAT KLNDLCFSNVYADSFWKGD 392 

SEQ ID NO: 6072 LVKQKFIVYRENSVNTTCTI^ 309 



SEQ ID NO: 6053 NVSQTSIANinCNSVINRLRC^ 300 

SEQ ID NO: 6057 QVENTAITKVTYCNSHVNNKCSQITANLNNGFYPVSS — SEVGLVNKSVVLLPSFYTET 530 

SEQ ID NO: 6061 EVQGTSIQRILYCTDWSQIiKCSQVAPDLDIWFYPISSra 509 

SEQ ID NO: 6065 RKVDI^LGNLGYLQSFNYRIDTTATSCQL^ 459 

SEQ ID NO: 6069 RQVDLQI/3NSGFLQTANYKIOTA — INNHNPSSWNRRYGFNDA6 458 

SEQ ID NO: 6042 DVRQIAPGQTGV3JU)YNYK^ STGNYNYKYRYLRHG 446 

SEQ ID NO: 6072 SFLSSFVYKESNFMYGSYHPSCKFRLETIMGLWFNSLS VSIAYGPLQGGCKQS 363. 



SEQ ID NO: 6053 
SEQ ID NO: 6057 
SEQ ID NO: 6061 
SEQ ID NO: 6065 
SEQ ID NO: 6069 
SEQ ID SO: 6042 
SEQ ID NO: 6072 



FIVLYVDFKPQ-SGGGKCFNCYPAGVNITLANFNET KGPLCVDTSHFTTKYYAVYAN 356 

IVNITIGIjGMKRSGYGQPIASTLS — NITLPMQDHN TDVYCIRSDQFS-VYVHSTCK 584 

FVNITVS AAFGGLSSANLVASDTTINGFSS FCVDTRQFTITLFYNVTN 557 

VFKPQPAGVFTDHDVVYAQHCFKASTNFCPOTiM 519 

VFGKN QHDWYAQQCFTVRSSYCPC 483 

KLRPFER 453 

VFKGRAT 370 



SEQ ID NO: 6053 VGRWS ASINTGNCPFSFGKVNNFVKFGSVC^ 402 

SEQ ID NO: 6057 SALWDNIFKM(^VLDATAVI^ 644 

SEQ ID NO: 6061 SYGYVS KSQDSNCPFTLQSVNDYLSFSKFCVSTSLLAGACTIDLFG 603 

SEQ ID NO: 6065 TNYLTCHNMQCDCLCTPDPITSKATO G 575 

SEQ ID NO: 6069 AQPDIVSPCTT — QTKPKSAFVNVGDHCEGLGVLEDNCGNADPH 525 

SEQ ID NO: 6042 DISNVPFSPIX3KPCTPPALNCYWPIiND 480 

SEQ ID NO: 6072 CCYAYSYGGPSLCKGVYSGELDHNFECGL 399 



SEQ ID NO: 6053 NWAYSKY YTIG SLWSWSDGDGITGVPQPVEGVSSFMNVTLDKC 446 

SEQ ID NO: 6057 -RTRTNE QWR SLYVIYEEGDNIVGVPSDNSGVHDLSVLHLDSC 687 

SEQ ID NO: 6061 YPAFGSG VKLT SLYFQFTKGELITGTPKPLEGITDVSFMTLDVC 647 

SEQ ID NO: 6065 NPCTCQP QAFLGWSVDSCLQGDRCNIFANFILHDVNSGTTC 632 

SEQ ID NO: 6069 KGCICAN NSFIGWSHDTCLVNDRCQIFANIHiNGINSGTTCSTO 582 

SEQ ID NO: 6042 YGFYTTTGICT QPYRVVVLSFELLNAPATVCGPKLSTDLIOT 524 

SEQ ID NO: 6072 LVYVTKS GGSRIQTATEPPVITQNNYNNITLNTC 433 
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SEQ 


ID 


NO: 6061 


SEQ 


ID 


NO: 6065 


SEQ 


ID 


NO: 6069 


SEQ 


ID 


NO:6042 


SEQ 


ID 


NO: 6072 



TKYNIYDVSGVGVIRVSNDTFLN GITYTSTSGNLLGFKDVTKGTIYSITPC 497 

TDYNIYGRTGVGI IRQTNRTIiLS GLYYTSLSGDLLGFKNVSDGVIYSVTPC 738 

TKYTIYGFKGEGIITLTNSSILA GVYYTSDSGQLLAFKNVTSGAVYSVTPC 698 

VNYDLYGITGQGIFVEVNATYYNS WQNLLYDSNGNLYGFIU)YLTNRTFMIRSC 685 

VKYDLYGITGQGVFKEVKADYYNS WQTLLYDVNGNI^FMLTTNKTYTIRSC 635 

VHPHEBGLTGTGVLTPSSKRFQP PQQFGRDVSDPTDSVRDPKESKILDISPC 576 

VDYNIYGRTGQGFITNVTDSAV^ 493 
>>••••••• •• ••• • • 

NPPDQLWYQQA — VVGAMLSENFTSYGPSNVVELPKFFYASNGTYN 542 

DVSAQAAVIDGT — IVGMTSINSELLGLTHWTTTPNFYYYSIYNYT^ 796 

SPSEQMYVNDD--IVGVISSI^S--TPNNTmro 741 

YSGKVSAAFHAN- -SSEPALLFm 743 
YSGRVSAAFHKD--APEPALLYRN^ 693 
AFGGVSVITPGflSIASSEVAVLYQDV^^ 636 
KDVNQQFWSGGK-LVGILTSRNETGSQLLENQFYIKITO 544 



-CTDAVLTYSSFGVCADGS IIAVQPRNVSYDSVSAIVTAN LSI 584 

VDCEPVITYSNIGVCKNGA FVFINVTHSDGD-VQPISTGN VTI 838 

-CTEPVLVYSNIGVCKSGS IGYVPSQYGQVK-IAPTVTGN ISI 782 

I^QTCDLTVGSGYCVDYS 803 

EALPNO)I^GAGLCVDYSKSW 753 

IGAEHVDTSYICTIPIGAGIC^^ 696 

VANCPYVSYGKFCIKPDGS IATIVPKQLEQFVAPLFNVTEN VLI 588 



PSNVOTSVQVEYLQITSTPIVVIE^ 644 

PTNFTISVQVEYIQVYTTPVSIDCSRY^ 898 

PTOTSMSIRTEYLQLYNTPVSVKATYVCNGNSRCKQIj^ 842 

PSEPTIGMEEFIQTSSPKVTIIX^SAWCEDYMOT 863 

PTNFTIGHHEEPIQTRSPKVTIIX^AFVCGIM 813 

PWSISITTEVMFVSM^SV^^ 756 

PNSFNLTVTDmQTRlffiKVQ 648 



SADVSEMLTFDKKAFTLANVSSPGDYN LSSVIPSLPTSGS 684 

NMEVDSMLFVSENALKLASVEAFNSSOT 954 

SVEVNSMLTISKEALQLATISSPNGDG YNFTNVLGASVYDPASGR 887 

DTTQI^VMSI^GVTLSTKLKIX5V1^ VDDINFSPVLGCLGSDCN 909 

DNMQLQVASAIiMQGVTISSRLPDGISGP IDDINFSPLLGCIGSTCAEDGN 863 

DRRTREVFAQVKQMYKTPTIiKYFGGFN FSQILPDPLKPTK 796 

DMELLNFYSSTKPAGFNTPVLSNVSTG EFNISLLLTNPSSRRK 691 



— RVAGRSAIEDILPSKLVTSGLGTVDADYKKCTKGLS — IADLACAQYYNGIMVLPGV 739 

KRKYRSMEDLLFDKWTC^ 1009 

WQKRSVIEDLLFNKVVTNGIiGTVDEDYKRCSNGRS — VADLVCAQYYSGVMVLPGV 942 

KVSSRSMEDLLPSKVKLSDVG-FVEAYNNCTGGAE — IRDLICVQSYNGIKVLPPL 963 

GPSAIRGRSAIEDIIiFDIOT^^ 920 

RSFIEDLLFNKVTLADAG-FMQYGECLGDIN — ARDLICAQKFNGLTVLPPL 846 

RSLIEDLLFTSVBSVGLP-TNDAYKNCT 743 

** ****** . * * ***..*. *** . 
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SBQ ID NO: 6053 ADAERMAMYTGSLIGGIALGGLTS AVSIPPSIAIQAMiNYVALQTDVLQENQKILA 795 

SBQ ID NO: 6057 ANADIO«GMYTASLA.GGITLGALGGG AVAI PFAVAVQARLNYVALQTDVLNKNQQIM 1066 

SEQ ID NO: 6061 VDAEKLHMYSASLIGQ4ALGGITA AAALPFSYAVQARIOTLALQTDVLQRNQQI^ 998 

SEQ ID NO: 6065 LSENQISGYTLAATSASLFPPWSA MGVPFYMVQYRINGI6VTMDVLSQNQKLIA 1019 

SEQ ID NO: 6069 LSESQISGYTTGATAAAMFPPWSA MGOTPSLSVQYRINGIX3VTMNVLSENQKMIA 976 

SEQ ID 110:6042 LTDBfflLAAYTAALVSGTATA^ 906 

SEQ ID NO: 6072 ITAMQALYTSSLVASMAFGGITA AGMPPATQLQARINHIjGITQSLLLKNQKKIA 799 

• •»•• • • ••••••••• 

SEQ ID NO: 6053 ASFNKAMTNIVDAPTGVNDMTQTSQALQTVAT 855 

SEQ ID NO: 6057 SAFNQMGNITQSFGKVNDAIHQTSRGIA^ 1126 

SEQ ID NO: 6061 ESFNSMGNITSAFESVKEAISQTSKGIOTVAHAiyrK\^ 1058 

SEQ ID NO: 6065 NAFNNALGAIQBGPDATN SALVKIQAVVNMAEALNNLLQQLSNRF 1065 

SEQ ID NO: 60 69 SAFNNALGAIQDGFDATN- SALGKIQSVVNANAEAI^LMQLSNRP 1022 

SEQ ID NO: 6042 NQFNKAISQIQESLTTTS TALGKLQDWNQNAQALNTLVKQLSSNP 952 

SEQ ID NO: 6072 ASFNKAIGHMQEGFRSTS LALQQIQDWSKQSAILTETMASLNKNF 845 

**.*: : ..: ** ::***. :. *. .* .* 

SEQ ID NO: 6053 .QMSSSIQAIYDRLDTIQADQQVD^ 915 

SEQ ID NO: 6057 QMSSSISDHNRIiDELSADAQVDRLITGRM^ 1186 

SEQ 3D NO: 6061 QMSSSIDDIYSRLDILSADVQVDRLITGRLSALN^ 1118 

SEQ ID NO: 6065 GMSSSLQEILSRLDALEAQAQIDRLINGRLTAI^ 1125 

SEQ ID NO: 6069 GMSASLQEILTRLEAVEAKAQIDRLINGRLTALNAYISKQI^DS^ 1082 

SEQ ID NO: 6042 GMSSVLHDILSRLDKVEAEVQIDIM 1012 

SEQ ID NO: 6072 GAISSVIQEIYQQFDMQANAQVDRLITGRLSSLSVLASAKQAEYIRV^ 905 

. * ... . * .*.★*** *** .* * *. 

a • a ••»•«»• • •»••• • ■ 

SEQ ID NO: 6053 ECVKSQSKRYGFCG-NGTHIPSIVNAAPEGLVFLHTVL^ 970 

• SEQ ID NO: 6057 ECVRSQSQRFGFCG-NGTHLFSL^ 1244 

SEQ ID NO: 6061 ECVKSQSQRYGF(XX5DGEHIFSLVQAAPQGI^^ 1174 

SEQ ID NO: 6065 ECVKSQSSRINFCX^GNHIISLVQN 1180 

SEQ ID NO: 6069 EC^SQmiNFCG-NGNHII^LVQNAPYGLYFIOT 1137 

SEQ ID SO: 6042 ECVLGQSKKVDFOS-KGraLMSFPQAM 1067 

SEQ ID NO: 6072 ECVKSQSIRYSFCG-NGRHVLTIPQNAP^ 964 
*** *. * *** * * ** *. *.* * •* 

a * ■ a • • • • a • • ••• 

SEQ ID NO: 6053 TNGYVLRQPNLALYK EGNYYRITSRIMFEPRIPTMADFVQIENCNVTFVNISRS 1024 

SEQ ID NO: 6057 TFGLWKDVQLTLFRN LDDKFYLTPRTMYQPRVATSSDFVQIEGCDVLFVNATVS 1299 

SEQ ID NO: 6061 EIALTLREPGLVLPTHELQTYTATEYFVSSRRMPEPR 1234 

SEQ ID NO: 6065 DRGIAPKSGYFVNVNN TWMPTGSGYYYPEPITGNNVVVMSTCAVNYTKAPW 1232 

SEQ ID NO: 6069 DRGLAPKAGYFVQDDG EWKFTGSSYYYPEPITDKNSVI34SSCAVNYTKAPEV 1189 

SEQ ID NO: 6042 -KAYFPREGVFVFNGT SWFITQRKFFSPQIITTDNTFVSGNCDVVIGIINNT 1118 

SEQ ID NO: 6072 QYAIVPANGRGIFIQVN GSYYITARDMYMPRAITAGDVVTLTSCQANYVSVNKT 1018 

a. »♦ ♦ . ^ ^ , 

SEQ ID NO: 6053 BLQTIW-EYIDVNIOTLQELSYiai- 1082 
SEQ ID NO: 6057 DLPSIIP-DYIDINQTVQDILENFRPNWTVPELTPDIFNATO 1358 
SEQ ID NO: 6061 QLPD\TCP~DYIDVNKTLDEILAS 1292 



SEQ ID NO: 6065 MLNISTP-NLPYFKEELDQWFKNQTSVAPDLSIiDY — INVTPLDLQDEMN 1279 

SEQ ID NO: 6069 PIOTSIP-NPPDFKEELDKWFKNQTSIAPDL 1238 

SEQ ID NO: 6042 VYDPLQP-ELDSFKEELDKYFKNHTSPDTO 1166 

SEQ ID NO: 6072 VITTFVDNDDPDFNDEI^KVWDTKHELPDFDKFN — YTVPILDIDSEID 1066 
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SEQ ID NO: 6053 YTVQKLQTLIDNINSTLV^ 1142 

SEQ ID NO: 6057 NTTVELAILIDNINNTLVNL^^ 1418 

SBQ ID NO: 6061 NTOEBLRSLINNINNTLVDLEWI^ 1352 

SEQ ID NO: 6065 RLQEAIKVLNQSYINLKDIG^^ 1335 

SEQ ID NO: 6069 ItfQDAIKKLNBSYINLKEV^ 1294 

SEQ ID NO 1 6042 MJEVAKNLHESLIDLQEL 1222 

SBQ ID NO: 6072 MQGVIQGLNDSLIDIiE^ 1122 



SEQ ID NO: 6053 
SBQ ID NO:6057 
SBQ ID NO: 6061 
SBQ ID NO: 6065 
SBQ ID NO: 6069 
SBQ ID NO: 6042 
SEQ ID NO: 6072 



CGFPSCPA SSIRGCCESTKLPYYD-VEKIHIQ 1173 

CGCIGCLG SCCHSICSRRQFENYBPIEKVHVH 1450 

CGCCGCC6 ACFSGCCRGPRLQFYEAFEKVHVQ 1384 

G — TSCFK KCGGCCDDYTGHQELVIKT SHED- 1364 

6 — SCCFK kcgno:deygghqdsivihnisshed- 1326 

CSCLKGAC SCG- SCCKFDEDDSEFVLKGVKLHYT- 1255 

CmXXX^TMPT^SKCGKKSSYYTTFIM^ 1164 



FIGURE 7B 



SEQ ID NO : 6054 MFLKLVDDHA- LVVNVLLViCVVLIVILLVC IT I IKLIKLCFTCHMPCNRTVY 51 

SEQ ID NO: 6062 MLQLVNDNG-LVVNVILWLPVLFPLLIISITF^ 54 

..SEQ ID NO: 6058. MTFPRALTVIDDbTC-M^nNIIFWFLLIII 59 

SEQ ID NO: 6045 OTSFVSEETGTIJVNS^^ 52 

SEQ ID NO: 6073 MNLLNKSI^ENG-SFLTAm 57 

SEQ m NO: 6066 MFMADAYFADTVWYVGQIIFIVMCLLVIIVVVAPM 54 



SEQ ID NO: 6054 GP IKNVYH- IY-QSYMH IDPP PKRVIDF 77 

SEQ ID NO: 6062 TP IGRLYR-VY-KSYMR IDPL PSTVH3V- 80 

SEQ ID NO: 6058 VP AQHAYD-AY-KNFMR IKAYN PDGALLA - 86 

SEQ ID NO: 6045 KP TVYVZS-KV-KHUJS SEGV PDLLV 76 

SEQ ID NO: 6073 IPGAKGTAFVYKYTYGRIO^NNPEL^ 112 

SEQ ID NO: 6066 SP SIYVFN-RG-RQFYBF YNDVKP FVLDVDDV 84 



FIGURE 7C 



SEQ ID NO: 6055 — MSNDNC TGDIVTHLKNWNF 19 

SEQ ID NO: 6063 MSNGSIP VDEVIEHLRNWNF 24 

— MILLIMCVIACA(^BRYCAMKSIM)^ 60 



SEQ ID NO: 6059 

SEQ ID NO: 6067 MSSVT-TPAFVYTWT- 

MSSTTQAPEPVYQWT- 

HADNGTIT 



SEQ ID NO: 6070 

SEQ ID NO: 6046 

SEQ ID NO: 6074 MPNETNCTLD— 



-ADEAIKFLKEWNF 27 
-ADEAVQFLKEWNF 33 
-VEELKQLIiEQWNL 21 
-FEQSVQLFKEYNL 28 



SEQ ID NO: 6055 
SEQ ID NO: 6063 
SEQ ID NO: 6059 
SEQ ID NO: 6067 
SEQ ID NO: 6070 
SEQ ID NO: 6046 



GWNVILTIFIVILQFGHYKYSRItf^^ 78 

TWNI HjTILLVVLQYGHYKYSVFLYGV^ 83 

SWSIILIVFITVLQYGRPQFSWmGIKMLIMWLLWPVVL^ 120 

SLGIILLFITVILQFGYTSRSM^^ 84 

SLGIILLFITIILQFGYTSRSMFDTVVKm 90 

VIGFLFIiAWIMLIiQFAYSNRI^ 78 
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SEQ ID NO: 6074 



FITAFLLFLTIILQYGY^^ 85 



SEQ ID NO: 6055 FSFFMAVSTLVMWVMW^ 137 

SEQ ID NO: 6063 FSII^CITIMiWIMYFVNSIRLWRRTHS^ 142 

SEQ ID NO: 6059 FSIACyUOTFVLWIMYFVR^ 179 

SEQ ID NO: 6067 FSIVFTIVMlMIVyFVNSIRLFIRTGSMS 143 

SEQ ID NO: 6070 FSIVFTIVSIVIWIMYFVNSniLFIRTGSWW 149 

SEQ ID BO: 6046 IAIAHACIVGIiHWLSYFVASFRI^ 137 

SEQ ID NO: 6074 AMILTVFACLSFVGYWIQSIRLFKRCRSMSFN^ 145 



SEQ ID NO: 6055 GITVTLLSGVLYVDGHRLASGVQVHNLPEYM^ 195 

SEQ ID NO: 6063 GmTLLSGTLLVEGYKVATGV^ 200 

SEQ ID NO: 6059 GVTLTLLSGNLYAEGFKIA^ — WA 237 

SEQ ID NO: 6067 TLTVTIIRGHLYMQGIKLGTGYSLSDLPAYVTVAKVSffi — FA 200 

SEQ ID NO: 6070 TLTATI IRGHLYMQGVKU3TGF — FA 206 

SEQ ID NO: 6046 VIGAVIIRjGHLRMGHSLGR^^ 193 

SEQ ID NO: 6074 VLSPIIKNt^YCEGQWLAK-CEP 204 



SEQ ID NO: 6055. FYVRVKHGDFSAVSSPMSNMTENERLLHFF 225 

SEQ ID NO: 6063 FYVRSKHGDYSAVSNPSAVLTDSEKVLHLV 230 

SEQ ID NO: 6059 YYVKSKAGDYSTEAR-TIMiSEQEKLLHMV 266 

SEQ ID NO: 6067 VYVKSKVGNYRLPSTQKGSGLDTALLRNNI 230 

SEQ ID NO: 6070 VYVKSKVGNYRLPSN-KPSGADTALLR — I 233 

SEQ ID NO: 6046 AYNRYRIGNYKLNTDHAGSNDNIALLVQ — 221 

SEQ ID NO: 6074 TFVYAKQSVDTGELESVATGGSSLYT 230 



FIGURE 7D 



SEQ ID NO: 6056 MATVKWADASE PQRGRQG 18 

SEQ ID NO: 6064 MASVSF QDRGRK 17 

SEQ ID NO: 6060 MANQGQRVSWGDEST KTRGRSNSRGRKN 31 

SEQ ID NO: 6068 MSFTPGKQSSS-RASSGNRSGNGILK WADQSDQSRNVQTRGRR-AQPKQTATSQQPS 55 

SEQ ID NO: 6071 MSFVPGQMAGGRSSSVNRAGNGILKKTTWADQTERG 58 

SEQ ID NO: 6051 MSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQGLPN 48 

SEQ ID NO: 6075 MASGKAAGKTDAPAPVIKLGGPKP PKVGSS 35 



SEQ ID NO: 6056 RIPYSLYSPLLVDSEQPW-KYIPRNLWINKK-DKNKLIGYWN — VQKRFRTRKGK 70 

SEQ ID NO: 6064 RVPLSLYAPLRVTNDKPLSKVLANNAVPTNKG-NK^ — EQIRWRMRRGE 70 

SEQ ID NO: 6060 — NNIPLSFFNPITLQQGSKFWNLCPRDFVPKGIG-NRDQQIGYM — RQTRYRMVKGQ 85 

SEQ ID NO: 6068 GGNVVPYYSWFSGITQFQKGKEFEFAEGQGVPIAPGOTATEMGYWYRH^ 115 

SEQ ID NO: 6071 SGSWPHYSWFSGITQFQKGKEFQFAEGQGVPIMG 118 

SEQ ID NO: 6051 NT ASWFTALTQHGK-EELRFPRGQGVTINTNSGPDDQIGYYRRATRR- 101 

SEQ ID NO: 6075 GNASWFQMKAKKLNTPPPKFEGSGVPDNENIKPSQQHGYWR — RQARFKPGKGG 88 

* : : - ** .**:::* 

SEQ ID NO: 6056 RVDLSPKLHFYYLGTGPHKDAKFRER 130 

SEQ ID NO: 6064 RIEQPSNWHFYYLGTGPHGDLRYRTRTEGVFV^^ 130 

SEQ ID NO: 6060 RKELPERWFFYYI^TGPHADAKFKDKLIXJVVWVAKDG 145 

SEQ ID NO: 6068 QRQLLPRWYFYYLGTGPHAKDQYGTDIDG — EAIP 173 
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SEQ ID NO: 6071 QKQIiPRWYFYYLGTGPHAGAS^ — EAIP 176 

SEQ ID BO: 6051 HKELSPHWYFYYLGTGPEASLP^^ 159 

SEQ ID NO: 6075 RKPVPDAWmYTGTGPAADI^WGDTQre 146 

*** **** *** * 

• «•••• •• 

SEQ ID NO: 6056 QKLPNGVTWEE PDSRAPSRSQSR SQSRGRGESK 164 

SEQ ID NO: 6064 QQLPSWEIVEPNTPPASRANSISRSRGNGNNR^ 190 

SEQ ID NO: 6060 6KVPGEFQLEVN QSRDNSRSRSQ SRSRSRNR 176 

SEQ ID NO: 6068 TRFPPGTVLPQGYYIEGS-GRSAPNSR STSRASSRASS 210 

SEQ ID NO: 6071 TRFAPGTVLPQGFYVEGS-GRSAPASR SGSRSQSRGP 212 

SEQ ID 130:6051 LQLPQGTTLPKGFYAEGSRGGSQASSR SSSRSRCTSR 196 

SEQ ID NO: 6075 LRFSDGGPDGNFRWDFIPLNRGRSGRS TAASSAAASR 183 



SEQ ID NO: 6056 PQSRNPSSDRNHN SQDDIMKAVAAALKSLGFDKPQEKDKKS 205 

SEQ ID NO: 6064 ASQNRGGNNNNNNKSR^^ 249 

SEQ ID NO: 6060 SQSRGRQQFNNKK DDSVEQAVLAALKKLGVDmQQQRS- 215 

SEQ ID NO: 6068 AGSRSRANSGNRT PTSGVTPDMADQIASLVLAKLGKDAAKP 251 

SEQ ID NO: 6071 NNRARSSSNQRQ PASTVKPDMAEEIAALVLAKLGKDAGQP 252 

SEQ ID NO: 6051 NSTPGSSRGNS PARMASGGGETALALLLLDRLNQLESKV 235 

SEQ ID NO: 6075 APSREGSRGRR SDSGDDLIARAAKI IQDQ 212 



SEQ ID NO: 6056 AKTGTPKPSRNQSPASSQTSAKSLARSQSSETKEQKHM^ 265 

SEQ ID NO: 6064 QQKPKQEKSDNSGKNTPK KNKSRATSKERDLKDIPEWRRIPKG — ENSVAACF 300 

SEQ ID NO: 6060 RSKSKERSNSKTRDTTPK NENKHTWKRTAGK GDVTRFY 253 

SEQ ID NO: 6068 QQVTKQTAKEIRQKILN KPRQKRSPNK— QCTVQQCF 286 

SEQ ID NO: 6071 KQVTKQSAKEVRQKILN KPRQKRTPNK — QCPVQQCF 287 

SEQ ID NO: 6051 SGKGQQQQGQTVTKKSAAEASK KPRQKRTATK — QYNVTQAF 275 

SEQ ID NO: 6075 QKKGSRITKAKADEMAH RRYCKRTIPP — NYRVDQVF 247 

.* * 



SEQ ID NO: 6056 GPRDLDH NFGSAGVVMGVKAKGYPQFAELVPSTAAMLPDSHIVSKESG 314 

SEQ ID NO: 6064 GPRGGFK NFGDAEFVEKGVDASGYAQIASLAPNYAALLFGGNVAVREIA : 349 

SEQ ID NO: 6060 GARSSSA NFGDTDLYANGSSAKHyPQLAECVPSVSSILTOSYWTSKEDG 302 

SEQ ID NO: 6068 GKRGPNQ NFGGGEMLKLGTSDPQFPILAELAPTAGA 343 

SEQ ID NO: 6071 GKRGPNQ NFGGSEMLKLCTSDPQFPILAELAPTTC - SGGADE 342 

SEQ ID NO: 6051 G^GPEQTQGNFGDQDL331QGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTP 327 

SEQ ID NO: 6075 GPRTKGK-EGNFGDDKMNEEGIKDGRWAMLNLVPSSHACLFGSRW 298 

* * *** * * . . 



SEQ ID NO: 6056 NTWLTFTTRVTVPKDHP HLGKFLEELNAFTR EMQ 349 

SEQ ID NO: 6064 DSYEITYNYKMTVPKSDP NVELLVSQVDAFKTGNM-LQRKKEKKNHIETTLQ 401 

SEQ ID NO: 6060 DQIEVTFTHKYHLPKDDP KTGQFLQQINAYAR PSEVAKEQR 343 

SEQ ID NO: 6068 PQKDVYTiMYNGMRFDSTLSGFETI GTMNMSPKPQRQ — R 397 

SEQ ID NO: 6071 PTKDVYELQYSGAVRFDSTLPGFETIMVLNE^ GADWSPKPQRKGRR 398 

SEQ ID NO: 6051 SGTWLTYHGAIKIiDDOTPQFKDNVIIiIJnCHIMYKTCT PTE 368 

SEQ ID NO: 6075 DGLHLRFEFTTVVPCDDPQFDNYVKIC^ 355 



SEQ ID NO: 6056 QHPLLNPSALEFNPSQTSP ATAEPVRDEVSIETDIIDEVN 389 

SEQ ID NO: 6064 * QHEEMYDDVGAPSDVTHANLEWDTAVIX5GDTAVEI INEIFDTGN 446 

SEQ ID NO:6060 KRKSRSKSAERSEQDVVPDALIENYTDVFDDTQVEIIDEVTN 385 

SEQ ID NO: 6068 GQKNGQGENDNISVAAPKSRVQQNKIRELTAEDISIiKKMDEP FTEDTSEI 448 

SEQ ID NO: 6071 QAQEKKDEVDNVSYAKPKSSVQRNVSRELTPEDRSLLAQII^ 454 
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SEQ ID BO: 6051 -PKKDKKKKTDEAQPLPQRQKKQPTVTLLPAA 399 

SBQ ID NO: 6075 NSPAPRQQRPKKEKKLKKQDDEADKALTSDKKRNNAQI^FYDEPKVINWGDAALGENBL 414 



FIGURE 7E 



FIGURE 7F 
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FIGURE 7G 
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FIGURE 7H 



22SE 



PEOV 



TOR2PEP13 
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FIGURE 8 



Section 1 

(1)1 10 22 ,30 42 

avian IBV partial 5'UTR 161- (1) TATTAAAATCTTATTGTTGCTGGTATCACTGCTTGTTTTGCC 

HCoV-OC43 5 , UTR (1) 

bovine CV 5'UTR (1) 

Consensus (1) 

Section 2 

(43) 43 gO ,60 ,70 84 

avian IBV partial 5UTR 161- (43) GTGTCTCACTTTATACATCTGTTGCTTGGGCTACCTAGTGTC 

HCoV-OC43 5'UTR (1) 

bovine CV 5UTR (1) 

Consensus (43) 

Section 3 

(85) 85 £0 ,100 J10 126 

avian IBV partial 5'UTR 161- (85) C AGC G TC C T ACG GGCGT C G T GGCTG GT T C G AG T G CGAG G AAC 

HCOV-OC43 5'UTR (1) 

bovine CV 5'UTR (1) 

Consensus (85) 



(127) 

avian IBV partial 5'UTR 161- (127) 
HCoV-OC43 5 l UTR (1) 
bovine CV 5'UTR (1) 
Consensus (127) 



127 



,140 



CTCTGGTT 




150 




(169) 

avian IBV partial 5'UTR 161- (169) 
HCoV-OC43 5'UTR (33) 
bovine CV 5'UTR (33) 
Consensus (169) 



169 



(211) 

avian IBV partial 5'UTR 161- (209) 
HCoV-OC43 5'UTR (68) 
bovine CV 5'UTR (68) 
Consensus (211) 



gtSggc? 

gattg gagcgatttgcgtgcgtgca tcccgc 

- • ~ Section 5 

180 ,190 gOO 210 

teT^aAAifl 



G AC G T ACgG^T[Sp^$G^^p5G T^AA^gA — .Q© # G G G$ t§AC 

PPw» 

!s$g&wl^^ 

TTCA CTGATCTCTTGTTAGATCTTTTCGTAATCTA 

Section 6 

211 220 230 £40 252 





ctPccccc^I 

aactttataaaaacatccactccctgtagtctatgcctgt^ 
. Section 7 



(253) 

avian IBV partial 5'UTR 161- (251) 
HCoV-OC43 5'UTR (110) 
bovine CV 5'UTR (110) 
Consensus (253) 



253 



280 



294 



|£jTAc|jTS^ 

G C GT AG AT T T T T CAT AG T G G T G T C T AT AT T CATTTCT GCT 

— Section 8 

,310 £20 336 



(295) 295 g00 



avian IBV partial 5'UTR 161- (293) ^G^GTGCp^ 

HCoV-OC43 5'UTR (150) (GTT^Ci^^ 

bovine CV 5'UTR (150) 

Consensus (295) G T T AAC AG C T T TCAGC C AG G G AC GT GT T GT AT CC T AGGC 
— Section 9 



(337) 337 

SEQ ID NO: 
SEQIDNO: 

SEQ ID NO: 9892 :(189) 
consensus (337) 



.350 



,360 



373 




1^^-^£ca^ 

agtg' gcccacccataggtcacaatg 
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FIGURE 9 



F1: AT^G^G ^A-^ G^GTG 
TGC A CG TTT C 



SEQ ID NO: 

(136-154 nt) 6021 



F2: GTGkaTG^AT^C^CTTCA 

C C CC G 



(152-172 nt) 



6022 



F3: CTTCAC^G-^TCT%TGT^-GA 

T A C TA 



(168-195nt) 



6023 



R1: AGAA^ICAC-gTC^GGjTG (307-329nt) 
G TACAA G CCT C * 7 



R2: AAAf G^f TAT A^ cf^C grTATG (265-288nt) 
R3: cfjc ^TATG C-^GCCCA (250-274nt) 



6024 
6025 
6026 
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FIGURE 10 



avian IBV 3'UTR (NC_001451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



(D 
d) 
(D 
d) 
(D 



1 



,10 



Section 1 
36 



GTAACATAATGGACCTGTTGTTTCCTGGTACATTTT 



avian IBV 3'UTR (NCJ0O1451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



(37) 
(37) 
(D 
(D 
(37) 



37 



.50 



Section 2 
72 



GTTAAACACTATTTCTGTGCTTTCCTATCAATTATT 



73 



,80 



avian IBV 3'UTR (NC_001451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



.90 



-Section 3 
108 



ACAGGCATTGATTGTGATTATGTTCAATACTTAAGC 



avian IBV 3'UTR (NCJ)01451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



avian IBV 3'UTR (NCJD01451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



avian IBV 3'UTR (NC 001451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



avian IBV 3'UTR (NC_001451) 27103- 
HCoV-OC43 3'UTR partial 
bovine CV 3'UTR 
Consensus 



(73) 
(73) 

(1) 

(1) 

(73) 

Section 4 

(109) 109 t 120 ,130 _144 

(109) ttcttttggttgctttttgcttatVgtattgttgc 
(1) 

{1) 

(109) 

— Section 5 

(145) 145 m J50 160 v 170 _ 180 

(145) G'TGCTTTTTATTGTTG^ 

(1) 

(1) L^j 

(145) A GAGAATGAAC 

Section 6 

(181) 181 190 20 0 216 

(ISli'CTTAT G T CGG C AC C T 66 T G G T AACC CC T C GCAGG 

- - - — Section 7 

(217) 217_ 230 £40 _252 

(21 5) ^^G0AT^£GcixG^T^ CTj^A^pg^W^GC 

(48) m^^^G'iM j^^Glefc!^ 

(45) W^?£fim m^Bm& ^^e^^A G 

(217) AAAGTCGGG ATAAGGCAC TCTCTATCAG 
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FIGURE 10 (contd.) 



(253) 

avian IBV 3'UTR (NC 001451) 27103- (251) 
HCoV-OC43 3WR partial (76) 
bovine CV 3'UTR (73) 
Consensus (253) 



270 



(289) 

avian IBV 3'UTR (NC 001451) 27103- (287) 
HCOV-OC43 3'UTR partial (107) 
bovine CV 3'UTR (104) 
Consensus (289) 



Section 8 

, „ ,„ . . _ _ 288 

CK^ ,©MAA{r6J^f!AAT C T ^HcS^cMt AG j^^C C T GiGS 

AATGGA TG T CTTGCT G C T ATAATAG AT AG A G 

—Section 9 

289 £00 310 324 



gpACGASCG^fe^ 
A AG G T TATA G C AGAC TAT AG ATT 





avian IBV 3'UTR (NC 001451) 27103- (323) 
HCoV-OC43 3'UTR partial (139) 
bovine CV 3'UTR (136) 
Consensus (325) 



(325) 325 330 



.340 



AATTAGTTG 

Section 10 

,350 360 




(361) 

avian IBV 3'UTR (NC_001451) 27103- (358) 
HCoV-OC43 3'UTR partial (175) 
bovine CV 3'UTR (172) 
Consensus (361) 




(397) 

avian IBV 3'UTR (NC 001451) 27103- (393) 
HCoV-OC43 3'UTR partial (206) 
bovine CV 3'UTR (203) 
Consensus (397) 



(433) 

avian IBV 3'UTR (NC_001451) 27103- (429) 
HCoV-OC43 3'UTR partial (238) 
bovine CV 3'UTR (235) 
Consensus (433) 



AAAGTTTTGTGTGGTAATGTATAGTGTTGGAGAAAG 

— Section 11 

361 370 380 396 



TGCCAG^^Cp^G^C^^^^^^OT^C 

TG AAAGACT TGCGGAAGT^^GCCGACAAG 

— Section 12 

397 410 420 432 

G^^^^g^^^^^^ccA^pGj; 

TGCC C AAGGGG AAG AGC C AGC AC G TTAAGTTA 

Section 13 

433 j440 t 450 . 468 





(469) 

avian IBV 3UTR (NC.001451) 27103- (462) 
HCoV-OC43 3'UTR partial (274) 
bovine CV 3'UTR (271 ) 



CCA CCAGTAATTAGTAAATGAATGAAGTTAATTAT 

- — Section 14 

469 -480 ,490 504 




(505) 505 513 
avian IBV 3'UTR (NCJ)01451) 27103- (498) GTTAGAGCA 

HCoV-OC43 3'UTR partial (293) 

bovine CV 3'UTR (290) 

Consensus (505) 



Section 15 



SEQ ID NO: 9911 
SEQ ID NO: 9920 
SEQ ID NO: 9893 
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F-1 TCTATC-f^f A f GGATGTCT 

AGA T 



(245 - 265 nt) 



SEQ ID NO: 
6027 



F-2 TTAGTT J AA^TTT f GT J t| GT (318 - 339 nt) 



6028 



F-3 TAGTGTT ^ GAG ^ A | GT | TAA AG A (346 - 368 nt) 



r-1 a^tt^gccata^tJaactt 



(458 - 476 nt) 



6029 
6030 



R-2 actaa^t^g ^T^C Jtaa 

AATT A T CT CC C 



R-3 T ^TC^GC^T-^C^GCA 
AC C G GG CC G 



(426 ~448nt) 



(375-395nt) 



6031 

6032 
6033 



FIGURE 12 



Coils output for unknown 



window** 14 ' 
window=81 - 



£88 488 688 888 1888 1888 1488 
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SEQ ID NO: 6569 




1 sgfrkmafps gkvegcmvqv 
61 ksnhsf lvqa gnvqlrvigh 
121 spsgvyqcam rpnhtl 
181 fygpfvdrqt aqaagtsdtti 
241 pltqdhvdil gplsaqtgia 
301 sgvtfq 




Catalytic His 



ttlngl xjlddtvycpr [hpictaedml npnyedllir 
llrlk vdtsnptetpk jffcfvriqpgq tfsvlacyng 

Iptg vhagtdlegk 
tttlndf nlvamkynye 
Igscilede f tpfdwrqc 




Key residues of the substrate site 
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Section 1 
51 



avian 18V nsp2 (1) SGFKfcgvspgSAVEKClVSV^ KFg 

MHVnsp2 (1) ~" * ~ 

SARS nsp2 (1) 

BCoVnsp2 (1) 
Consensus 



SGFRKHAF 



SKl'EPCIVSVTyGNKTLtlGLVrLDDKVyCPRpiVie 
|GKVBQ^VQVTCGT^Ll)IQLWLl>D§VYCPP.y9ICi 



KFgG 

DKTD 

vicsa&dhtn 



(1) SGIVKMVSPSSKVBPCIVSVTYGinOTLHGLWLDDTVYCPRHVICSAADMtlir 
Section 2 



(52) 

avian lBVnsp2 (49) 
MHVnsp2 (52) 
SARSnsp2 (52) 
BCoVnsp2 (52) 
Consensus {52) 
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(103) 

avian 1BVn3p2 (100) 
MHVnsp2{1Q2) 
SARS nsp2 (102) 
BC6Vnsp2<102) 
Corts&nsus (103) 



pi>CTgLLCItV^^PC VHP "1$ MfVMS YOMOQCQX-yiUtm^QH PN? PKY 
j£fr&SDXiL I HK^raS PLVO^H- VQlJR'/^^MQncgLR L KVDTS N PffTFKY 
SP^S^liuCRV^^OFTVSPDft -gS LfTVaS Y 'QMtjtfCgLy ^LTV^ Jlft>ISgT P K Y 
PDY HLI-CRVTSSDP VLSGR ySLTVMSYQMQGCLLVLTVTLQWPKTPKY 

1 ' — SectonS 

103 .11 0 ^ _ .120_ .130 ,140 _ 153 

&£IKANC -~ JW ^ "'" " Jr " ~ " 



pgsv 
sv 



<154) 

avian fBVnspS (151) 
MHVnsp2(153) 
SARS nsp2 (153) 
BCoVnsp2(153) 
Consensus (154) 



i§FVRg^G.Q&F^ . 
TFG.toiq^^^ 
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— . — : — : 1 Section 4 

204 




154 



J60 



190 



gKGV V»F F^i^fcE LPNlLH*5t E#f G^: T GG W'D EEVaQRV P P DNLV$NN 

§Yi^VSFCaw^^ 
MGI^tff^ 

DGDCVKFVYMHQLBLSTGCHTG^Dti GDFYGPYVDAQVVQLPVQDYTQTVM 

■ ; Sections 

2 05 210 220 23a 240 25 5 



(205) 

avian 18V nsp2(202) 
MHVnsp2(204) 
SARS nsp2 (204) 
BCoVnsp2(204) 

Consensus (205) VVAWLYAAIIW CK 



WIiYAJQ? X3 VKB SSFS LP) 

VTOWtFA&ipwt^^ r "W 

^ ,WLYSJigi§GbR ' 






KW^GDiKGf^FSiS - -? 

£*#JAikYI^ 
DFtrWA&GFSQlKiD- -£ 



(259) 

avian IBVnsp2 (251) 
WHVnsp2(247) 
SARS nsp2 (249) 
BCoVnsp2(247) 
Consensus (256) 



25ft 




270 



WFLQSOTCSliEDFKVWAMSNGFSPXKSD L 

— : : i Sections 

280 280 _ 306 



SgVdVcki^^ 

VIDALAAMTGVSVE IjLAAIKRIi s gfqgrqilgsciledeltpsdvyqq 
: SecBon 7 



(307) 

avian IBVnsp2 (301) 
MHVrtSf>2(297) 
SARS nsp2 (300) 
BCoVnsp2(297) 
Consensus (307) 



307 31JL 

JiAGVKLQ 
cfgjGVTFQ 
LAGgKIiQ 
XiAGVKXiQ 



SEQ ID HO: 6570 

SEQ ID HO: 6571 

SEQ ID HO: 6569 

SEQ U> MO: 6572 

SEQ H> HO: 6573 
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FIGURE 18 
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FIGURE 19 
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FIGURE 20 
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FIGURE 21 
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FIGURE 23 

LPRKSQPTSISCRSVL-TNFKICVAVARL^ 
I^RPCCSRSSAYIiGFVRV-PKGKMESLVLGVNEKTHVQLS 

ALS EAREHLiKNGTCGLVEIjEKGVIjPQLEQP YVF I KRSDAL STNHGHKVVEIjVAEMDG I QY 
GRSG I TLGVLVPHVGETP I AYRNVLLRECNGNKGAGGH S YG IDLKS YDLGDELGTDP I EDY 
EQNWNTKHGSGALRELTI^IjNGGAVTRYVDNNF 
QLDYIESKRGVYCC31DHEHEIAWF 

NS KVKVI QPRVEKKKTEGFMGRI RSVYPVAS PQECNNMHL STLMKCNHCDEVSWQTCDF L 
KATCEHCGTElSrLVIEGPTTCGYLPTNAVVKMPCPACQDP^ 

RKGGRTRCFGGCVF AYVGC YNKRAYWVPRAS AD I G SGHTG I TGDNVETLNEDLLiE I L SRE 
RVNINIVGDFKLNEEVAI II^SFSASTSAFIOTIKSLDYKSFKTIVESCGNYKVTKGKPV 
KGAWNIGQQRSVLTPLCGFPSQAAGVT 

LRLVDAMVYTSDLIiTNSVI IMAYVTGGLVQQTSQWLSNIiIiGTTVEKLRPIFEWIEAKLSA 
GVEFLKDAWEILKFLITGVFDIVK^ 
AKLRSLNIjGEVFIAQSKGLYRQCIRGKEQLQLIjMPL^ 
LKNGELEALETPVDSFTNGAIVGTF^ 

GAP IKGVTFGEDTVWEVQG YKNVRI TFELDERVDKVLNEKC SVYTVE SGTEVTEFACVVA 

EAVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAE 

CEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVRVEEEEEEDWLDDTTEQSEIEPEPEP 

TPEEPVNQFTGYDKLTDNVAIKCVDIVKEAQSANPMVIVNAA 

NGAMQKESDDYIKIiNGPLTVGGSCLLSGH*^^ 

SQDILIiAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAVNDKALYEQV^ 

APKQEEPPNTEDSKTEEKSVVQKFVDVKPKIKACIDE 

IOiYHDSQNMLRGED^ 

VDEY I TTYPGQGC AGYTLEEAKTALiKKCKS AFYVLP SEAPNAKEE I LGTVSWNLREMIiAH 
AEETRKLMPICimVRAIMATIQRKYKGIKIQE 

NEPLVTMPIGYWHGFNLEEAARCMRSL^ S SKTSEEHF 

VETVSLAGSYRDWSYSGQRTEIjGVEFIjKRGDKIVYHTLESPVEFHLIXj 

SLREVKTIKVFTTVDISrimjHTQLVDM 

LPSDOTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFP 

LALQQLEVKFNAPALQEAYYRARAGDAANFC AL I LAYSNXTVGELGDVRETMTHLIiQHAN 
LESAKRVLNWCKHCGQKTTTLTGVEAVMYMGTLS YDNL I PCVCGRDATQYLVQQ 

E S SFVMMS AP PAEOTaiiQQGTFrjC ANEYTGIST^ QCGHYTH I TAKETLYRIDGAHLTKMS EYK 
GPVTDVFYKETSYTTTIKPVSYKLIX5VTYTEIEPKL 
PNASFDNFKLTCSNTIOFA^ 

KGAKLLHKPIVWHINQATTKTTFKPNTWCIiRCLWSTKPVOT 

CESQQPTSEEWENPTIQKEVIECDVKTTEWGNVILKPSDEGVKVTQEL^ 

ENTSITIIOCPNELSIiAIiGLKTIA^ 

AQRVFNNYMPYWTLLFQLCTFTKSTNSRIRAS 

PKFSKLFTIAMWLDL^ 

FCEGSFPCSICLSGLDSI^SYPALETIQVTI^^ 

l^LSAIMQWFGYFASHFISNSWL*^ 

MDGCTSSTCMMCYKRNRATRVECTTIVNGMK^ 

GSTFISDEVARDLSLQFKRPINPTDQSSYITVD^ 

FVNLDlSn^RAN^^ 

GDSTEVSVKMFDAYVDTFSATFSVPMEKLKALV^ 

VVDTDViyrKDVIECLKLSH^ IDCNARHIN 
AQVAKSHNVSL I WNVKDYMSLSEQLRKQ IRS AAKKNNI PFRLTCATTRQVVNVT TTKI SL 
KGGKIVSTCFKLMIjKATLLCVIjAALVCYIVMPVHTIjS^ 

I STDDCFANKHAGFDAWFSQRGGSYKNDKSCPVVAAI itreigfivpglpgtvlraingd 

FIjHFLPRWSAVGNICYTPSKLIEYSDFATSACVIiAAECTIFKDAM 

GSISYSELRPDTRYVLMDGSIIQFP^^ 

sgrwvlnnehyralsgvfcgvdamnlianiftplv 

ayyfmkfrrvfgeynhvvaanallflmsfti 

sflahlqwfamfspivpfwitaiywcisli^^ 



WO 2004/092360 



84/193 



PCT/US2004/011710 



CTFLLNKEMYIjKLRSETLLPLTQY 

FSNSGADVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTC 

CPRHVICTAEDMLNPNYED^ 

TPKYKFVRIQPGQTFSVLACmGSPSGVYQ 

S FCYMHHMEL PTGVHAGTDIJ3GKFYG PFVDRQT AQ AAG TDTT I TLNVLAWL YAAVI NGDR 
WFLNRFTTTIiNDFNLVAMKYNY^ 

GRTILGSTIIjEDEFTPFDVVRQCSGrVTFQGKFKKIVKGTHHWMLLTF 
SLFFFVYENAFLPFTLGIMAIAACAMLLVKHEHAFLCLFIiL 

MRIMTWLiEIjAiyrSLSGYRIjKIXJVl^AS ALVLL I IiMTARTVYDDAARRVWTLMNVT TLVYK 
VYYGNALDQAISMWALVISVTSOT^ 

LWCFLGYCCCCYFGLFCLLNRYFRLTLGVYDYIiVSTQEFRY 
NIKLLGIGGKPCIKVAWQSKMSDVKCT 
AKiyraEAFEKMVSIiLSVIiliSMQGAVDINRL 
QEAYEQAVANGDSEVVLKKLKKSIiNVAKSEFDI^ 

KE^AKVTSAMQTMLFTMLRKLDNDAIjNNI INNARDGCVPLNI I PLTTAAKLMVWPDYGTY 
KNTCDGNTFTYASALWE I QQWDADSKI VQL SE INMDNS PNLAWPL IVTALRANS AVKLQ 
NNEL S FVALRQMSCAAGTTQTACTDDNALAYYN^ PKSD 
GTGTIYTELEPPCRFVTDTPKGPKVK^ 
PANSTVLSFCAFAVDPAKAYKDYIiASGGQPITNCVK^ 
FGGASCCLYCRCHIDHPOTKGFCDLKGKYVQIPTT^^ 
CSCDQLREPLMQSADASTFLNGFAV-VQF^HliHRAAQALVIiMSSTGLL 
-KLIAVASRRRMRKAIY-TLTL-LRGI 

TWYHI YHVSV-LNTQWL I - SMLYVILMRVI VIH-KKYS SHTI AVMMI I SIRRIGMTS -RI 
LTSYAYMLT-VSVYANHY-RLYNSAMLCVMQAIi-AY-H- 1 IRILMGTGTI SVI SYK-HQA 
AEFLLWIHITHC-CPSSL-IjGHWLj^ 

IGTRHTIPIVLTVWMIGVSFIVQTLMCYFLLCFHLQVLDH — EKYL-MVFLLLFQLDTIF 
VS-ESYIIRM-TYIARVSVSRNF-CMDLIQL^^ 

LSNPVILIKTFMTLLCLETVSLRKEVLI^-NTSSIjLRMATIjLSVIMTI ivi icqqcvisdn 

SYS-LKLLINTLIVTMVAVLMPTK-SLTIWIN^ 

RILSVMSSLL-LK-ILS1^LVQRIEI^ 

LEQASFTVAGIIC-IOJFTVM-KLHTL^ 

HTVSTG-LTSVRKY-VRWSCVAAHYMI^ 

QliMVIR-LTSMSAIYNTGSMSVSIEIGMLIMNSWMSFTLiTCVNISP — FFLMMPLC AI TV 
TMRLKV — IiALRTLRQFF I IKIMCSCLRQNVGLRIjTLLKDLTNFAHS IQC -LNKEMITCT 
CLTQIHQEY-AQAVLSMILSKQMVHL-IiKGSCHWLLMLTH 

ES YMMSDIiATCWTC I P -C-LMITPHGTGNLSFMRLCTHHIQSCRL-VLVYCAIHRIiHFVA 
VPVDGDHS YVASAAMTMSFQHHTN-CCIiLI PMFAMPQWMSIiM-HNCI-EV-AIIASHIS 
LPIiVFHYVLMVRFLVYTKTHV- AVTMSLTSMR- QHVTGI1MI1AI T YLPTLVIiRDS SFSQQK 
RSKPLRKHLSCHMVTiPLYAKYSLTENCIFHGRLENLDHH-TETM 
STPLKKVTMVMLLCTEVIjRHTS-MLVI^ 

TSQMS FLAMLQI I KRS ACKSTLHSKDHLVLVRVILPSDLLS ITHLLA- C IRHAIiMQIiLMP 
YVKRH-NICP- INWESYLRVRA-SVIilNSK- IQH-NSMFSAL-MHCQKQLLTL- SLMKS 
LWLLIMT-VLSMLDFVQNTT^ 
SLELVAVVLLKLLTL-VL-FMTIS-KOT 

ENFLHAI LliGEKLFLSHLiI IHRTL-LQKS -DCIjRRLL IHHRVLNMTMS YSHKLLKQHTLV 

MSTASMWLSQGQKIxAFCA-CIjIEIFMTNCNLQV-KYHV 

LWILHRHLHTSALI -SSRLKDYVLTYQAYQRT-PTVDSSL- 

AKKLFVTFVRGLALM-RAVMQLEMLWVLTYIjS S -DFLQVIiT — LYRLVMLTLKI TQNSPE 
^QNLHQOTSLNILYHSCIKACPGM-C^^ 

STLSRLDLKERVVCVTNV QLAFLLHQ I LMPAG 1 1 LWVLTMS ITHL-LMFS SGALRVTFRV 
TMTNIARYMEMHM^ 

L-SLHCLLISFQFFMTLEIQRIjSSVCIjRIjK-NGSSTMLSHVVTKLTK 
NSLMVFVCFGIVTLIWQPMQLCVGL^ 

LLI - SNCLSFTILr^VSLMANK-CRJLIMFHSNLLRVIiHDAI -WLFADTMQMSTDSTW 
MHII — FLLDLAYGFTNNLILITC^ 

MLFTQR-MVLMWRSLKIRQHFIjIjMLHLSFGLSVTIjNQCQRLRYS I IWVLI SLLIIi- SGTT 
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KEKPQHMYLQ -VSAQ - LTL PRNLLRVLVLHLL SCLMVEWKDR - TFLET PVMVF — QKVQS 
KV-HLQRDQHKLASMESH-LENQ-KH^ 

WKLTFSSSLVMNSYSDISSRAMPSNTSFMEISVMDNIjAVFI A-PSAHKIHHLN-RIL 

SLWTAQ-KITS-QMRKQVHQNVCVIj-LIFYIjMTrjSR — SHKICQ-FQKWSRLQLTMLKFH 

SCFGVRMDMLKPSTQNYKQVKRGNQVLRCLTC 

MSQSILNCVNT-IHLL-LYPTT-ELPTLVLALIKEL^ 

S S PTH I LL - LETVQQ Y I RL INGTLLLAI CMTLGPNM- QKRMTLKKGF S L I CVDL - SKN- P 

WVVTi-L-R-QSIIX?MLTFTSL 

PC^TTFSGGTQILSSCLPIHSLT-ANFIJ^ 

EKTTELWFQVIFLLTTKRTCLFSYYFL^ 

I LMKFLDQTLF I - LRI YFFHF I IiMLQGF I LL I IRLATL S YIiLRMVF I LLPQRNQML SWG 
FLVIjP-TTSHSR-T>T»T,TILLMLLra 

STYLMPFRLMFQKSQVILNTYESLCLKIKMGFSMFIRAIOT 
CTjIjVLTLQILEPFLQPFHLLKT^ 
HLLNSNAIjLRALRIjTK^ 
EKKFLrVLLITI^STTQHFFQPL^ 

LVLLLI 1 1 INCQMI SWWSIil^IJ^TLMIiLQLVI 1 1 INIGIIiDMASIiGPLRETYLMCLSP 

LMANLAPHIjljLjIVIGH-MIMVFTPLIiAIiA 

TSVSIIjILMDSLVIjVC-IjLIjQRDFNHFNNLAVMFLISLI 

V- LHLiEQMLHLKLLFY I KMLTALMFIjQQFMQ INS HQLGAY I LLETMY SRliKQ AVL - EL SM 
STLIJ^ATFLIiEIAFVIi^ 

lallqk-clflwlkpp- rrecTS 

KCSLKSNKCTKPQL-NILWIilFHK^ 
A-VILMLEISFVRRSSMDLQCCHL^ 

CKWHIGSMALELPKMFSMRTKNKSPT^ * 

THLLNNLALILVQFQV^ 

LLLIIjLLIiKCLSWLDNQKEL^ 

QFVMKAKHTSLVKVFLCLMAIiLtGIjLHRGTSFLHK-LLQTIHL 

LCNLSLTHSKKSWTSTSKI IHHQMLILATFQALTIiLSSTFKEQKLTASMRSLKI -MNHSLT 
• FKNWENMSNimGLGMFGSASLLD-^ 

TLSQFSRVSNYITHKRTYGFVYEIFYSWINYCTASKN-QCFSCKYCSCYSMDTATSLT 

RMACYWRCISCCFSERYQNNCAQ-KMAASPL-GLPVHLQFTAAICYHLFTSFACRCRYGG 

AIFVTLCLDIFSTMHQRM-1^^ 

HRYNCRY- R-RHFNTKTQRRLPNWWLF - G - ALRC - RLCRCTWLFHRSLLP A- VYTNYYRH 
WY-KC YILHL- QAC-RPTECANTHNRRLFRSC - S SNGSNL — ADDDY-RAFVSTRK-VRT 
YVLIRFGRNRYVNS — RTSFSCFRGIIxASHTSHPYCASIVCSTLLQYC-REFSKTNGLRIjL 
AC-KSELF-RS S - SSGLNELTI 1 1 ILFGTLTLLIMADNGTITVEELKQIiLEQWNLVIGFL 
FliAWIMLLQFAYSNRNRFLYI I KLVFLWLLWPVTLACFVIjAAVYRINWVTGGIAIAMAC I 
VGLMWLSYFVASFRIiFARTRSMW^ 

HLRMAGHSLGRCD I KDLPKE ITVATSRTLS YYKLGAS QRVGTDSGFAAYNRYRI GNYKLN 

TDHAGSNDNIALLVQ-VTTDVSSC-LPGYNSRDIDYHYE^ 

- ASN- EELFGVR — RTYGVRLSIKRT-KLFSS-H-LYLHLAS^ 

AHQEHTRAIHHFTLLLTINLH-LALAHTL^ 

KSSTRHFFSLLLL-YF-YFASPLRERQNE-AHFN-LLFVLFSLSAIPCFNNAYYILVFTR 
NPGSRRTDYQSIiNEHETSHCFDLYFSMQLHMHCSTALCI — TSCA-RSL-GTTLGVILIA 
IiLGFVL-ERFYLFIDGTLWFIOiAHIj^^ 

F-INEQIKMSDNGPQSNQRS7VPRITFGGPTDSTDNNQNGGRNGARPKQRRP 

WFTAIjTQHGKEELRFPRGQGVPINTN^ 

YLGTGPEASLPYGAlSTKEGrVWVATEGALNTPKDHI 

YAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGETALAL^ 

SKVSGKGQQQQGQTVTKKSAAEASKKPRQKRT^^ 

IRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTP 

LLOTQIIDAYKTFPPTEPKKDKKKKT^ 

SGASADSTQA-TLMMTTQ 

L-SMCTIREDLKEPPHFHRGHAEYDRGYSE-C-GELPIWKSPW-N-F-CYPHVILIAS 
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FIGURE 24 

TQEKPTNLDLL- 1 C SLNEL -NIX? SCRS AACLVHLRS INNNKF YCR~ QETSNS SLFCRLLT 

VSSVLQSIISIPRFRPGVTER-DGEPCSWCQRENTR^ 

PIGGT-TPQKWHLWSSRAGKRRTAPA-TAIX:^ 

S-RYNTGSTRATCGRNPNCIPQCSSS-ER—^^ 

TKLEH-AWQWCTP-THS-AQWRCSHSLC^^ 

T-LHRVEERCLLLP-P-A-NCLVH-AIi — ELRAPDTLRN-ECQEI-HFQRGMPKVCVSS- 

LKSQSHSTTC-KEKD-GFHGAYTIjC^ 

SHL-TLWH-KFSY-RTYYMWVPTY-CCSEN^ 

QGR-D-MFWRIiCVCLCWLL — ACLIiGSSC-C-YWLRPYWHYW-QCGDLE-GSP~DTES-T 

C-H-HCWRFSFE-RGCHHFGIFLCFYKCL^ 

RCLEHWTTEI SFNTTVWFSLTGCWCYQINFCAOT 

TSCRRHGLYFRPAHQQCHYYGICNWWSCTTD^ 

S-ISQGCLGDSQISHYRCF-HRQGSOT^^ 

KVAITQIjR-SIjHRSKQGTIjPSVYTWQGAAATTHAS-GTKR^ 

QER-TRSTRDAR- -LHKWSYRRHTSLCECWPHAIiRD-GQR 

CTN-RCNLWRRYCLGSSKLQECENHI -A — TC-QSA — KVLCLHC- IRYRS Y-VCMCCSR 
GCCEDFTTSF- SPYQHGY- S — VECSYILLI — CW-RKLFITYVLFLLPSR-GRRGRCRV 
-GRRN — NL-T-VRYRG~LSRSPSGIWCIiS-NSSS~GRRRGRIiAG-YY-AIRD-ARTRTY 
T-RTS - SVYWLFKTY-QCCH-MC -HR-GGTKC- S YGDCKCC-HTPETWWWCSRCTQQGNQ 
WCHAKGE--LH-AKWPSYSRRVLFAFWT^ 

TGHLTCTr^SRHIWC-TTSVFTSVRADGSYTGLYCSQ-QSSL-AGCHGLS — PEA-SGS 
T-TRGATKHRRFQN-GEICRTEACRCEAKN-GLH — GYHNTGRN- VS YQ- VTLVC - YQW- 
ALP-FSEHA-R-RYVFP-EGCTLHGR-CYH-W-YHLCC^ 
— VYNHVP*mTOCWLYT^ 

-RDKKINAYMHGC-SHNGNHPT-V-RN-NSRGHR-LWCPILLLY — RACSFYYYEAELSK 
- AACHNANWLCDTWF- S -RGCALYAFS - SSCRSVSI ITRCC YYI -WI PHFVIKDI -GALC 
RNSFFGWLLQRLiVLFRTAYRVRC -IS - AW-QNCVPHSGEPRRVS S -R-GSFT- QTKESL I 
PAGG-DYKSVHNCGQH-SPHTACGYVYDIWTA^ 

T HTT — SFRVLPYS — EFSW-VHVCFKPHKEMEISSSWWFNFN-MG — QLLFV-CFI 

STSTA-SQIQCTSTSRGLL-SPCW-CC-LLCTHTRLQ — NCWRAW-CQRNYDPSSTAC-F 
GICKASS-CGV~TLWSENYYLNGCRSCDVYGYSIL — S-DRCFHSMCVWS-CTXTISSTTR 
WFCYDVCTTC-V-ITARYILMC^ 

TSD-CFLQGNILHYNHQACW-TRWSYIiHRD-TKIGWVL-KG-CLLYR 

KCEF — FQTHMF-HKIC — FKSNDRLHKASFTRAICHILPRIiEWRCSGY-Ii-TIiFSEFQE 

RC - ITA- ANCLAH- PGYNQDWQTKHLVFTLSLEYKASRYFKFI - S SGSRRHTRNGQSCL 

-KSTTHIi-RSSGKSYHTEGSHRV-RENYRSCRQCHT-TIR-RC-SNTRVRS~GS 

KHKHYH-ET — AFTSLRFKNNCHSWYCCN — CSLE-NFGLCQTILRTSSNYNIKLR~EIS 

TTCV-QLYALC^irVPIVYFY-K^ 

QIF-IVHNRYVAIVVKYLLRFSNLCN^ 

L-RFFSLQHLFKWIRLP-FLSSS-NHSGDDFIVQARLDNFRSGR-VGFGIYVVHKILLF^ 
RSFSYNAGVIjWIjFC-SFHQQFIjAHVVYH-YC 

GWLHLFDLHDVL-AQSCHTR-VYTTYC-WHEEIFLCLCKWRPWLLQ 

-YIH SCS-FVTPV-KTNQPY-PVIVYC — CCCEKWRASPLL-QGWSKDL-ETSALPF 

CQFRQFES-QH^RFTAY-CHSF-WQVQMRRVCF-VCFCVLQSADVPTYSVA-PSSCIRRW 
R-Y-SFR-DV-CLCRHLFSNF-CSYGKT-GTCCYSSQRVSKGCSFRWCPFYIRVSCPTRC 
C - YRC - HKGC Y -MS QTFTS L - LRSDR- QL - QFHAHL — G-KHDAQRSWRMY-L-CKAYQC 
PSSIOCSQCFTHLECKRLHVFI-TAA-TNS-C^ 
GW-DC-YLF-TYA-GHIIVRSCCIGLLYRYASTYI^ 

FY — LFCK-TCWF-RW-PAWWFIQK-QKLPCSSCYHYKRDWFHSAWLTGYCAESNQW 
IiAFSTSCF-CCWQHLLHTFQTH-V — FC YLCLRS CC - VYNF - GC YGQTCAI KL - H - FARG 
FYFL — ASSRHSL^AYGWFHHWS-HLPGG 

W-MGS ALQSSIRSFLWC-CDESHS-HLYSSC^TCGCFRCVCFSSGWWYYCHIGDIiCC 

IiLLYEIQTCFW-VQPCCCC-CTFVFD^ 

ILGSPSMVCHVFSYCAFLDNSNIiCILYFSEALPLVL-QLS -EKSHV-WSYI -YLRGGCFV 
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YLFAQQGIWPKIA-RDT^ 
-QLRC-CSLPTTTDrNHFCCSAEWF- 
SKTCHLHSRRHA-S-L-RSAHSQIQP-LSCSGWQ^ 
TQV-ICPYPTWSNIFSSSMLQWFT 

FLLYASYGASKHRSTRWY-LRR-ILWSIC-QTO^ — V 
VS — IHHYFE-L-PCGNEVQL-TFDTRSC-HIGTSFCSNRN^ 
SYYPW-HYFRR-VYTI-CC-TMLWCT^PR^ 
TVFLCIiRECFIAIYSWYYGN^ 

AYHDMA-IG-H-LVWL-A-GLCYVCFSFSFAYSHDSSHCL — CC-TCLDTDECHYTCLQS 

LLW-CFRSSYFHVGLSYFCOTJ-LFWCRYDYHVFS-SYSVC^ 

CLLFLRLLLLLLLW^ 

H-VVGYWR-TMYQGCT^CTV-NV-RKVHICGTAIX5SSTT~SRVIF-IV 
KRHN-SFREDGFSFVCFAIHAGCCRH — WRGNAR- PCY S SGY CFRI -FFTI ICRLCHCP 
GGL- AGCS-W-F- SRSQKVKE IFECG- 1 -V- P- CCHATQVGKDGRSGYDPWQTGKI -GQ 

EGKSN-CYANNALHYA-EA CT-QHYQQCA-WLCSTQHHTIDYSSQTHGCCP-LWYIjQ 

EHL-W-HLYICICTLGNPASC-CG-QDCST — N-HGQFTKFGLASYCY S SKSQLSC -TTE 
— TESSSTTTDVLCGWYHTNSLY — QCTCLL-QFEGR-VCAGITIRPPRSQMG-IP-E-W 
YRYNIiHRTGTTL-VCYRHTKRA-SEILV^ 

CQFNCAFLLCFCSRPC-SI-GLPSKWRTlTniQIjCEDVVYTHWYRT^ 

WWCFMLSVL-MPH-PSKS-RIL-LER-VRPlJITYHIiC — PSGFYT-KHSLYRIiRNVERLWL 
. - L- PTPRTLDAVCGC INVFKRVCGVSAARLTPCGTGTSTDVVYRAFDI YNEKVAGFAKFL ■ 
KTNCCRFQEKDEEGNLLDSYFVVKR^ 

MVPH I SRQRLrTKYTMADLVYALRHFDEGNCDTLKE I LVTYNCCDDDYFNKEODWYDFVENP 
DIIiRVYAJSTLGERWQSIiLKWQFOTAMRDAGIVGVLTLDNQ 

GVP IVD S YYSLLMP I LTLTRALAAE S HMDADLAKPL I KWDLLKYDFTEERLC IjFDRYFKY. 
-WDQTYHPNCINCIiDDRCILHCAlSlFNVLFSTVFPPTSFGPLV^ 
ELGVVHNQDVNLHSSRIjSFKEIiLVYAADPAMHAAS^ 

VKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNL 
LFWEWDKYFDCYDGGC INANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFA 
YTKRNVI PTITQMNLKYAI SAKNRARTVAGVSICSTMTNRQFHQKLIjKS IAATRGATWI 
GTSKFYGGWHNMLKTVYSDVETPHLMGWDYPK^ 
. HRFYRLANECAQVLSEMVMCGGSLYV^^ 

TIX3NKIADKYVRNLQHRLYECLYRNRDVDHEFVDEFYAYI/RKHFSMM 

YAAQGLVAS I KNFKAVIj YYQNNVFMS EAKCWTETDLTKGPHEFC S QHTMLVKQGDDYVYL 

PYPDPSRILGAGCFVDDIVKTIX3TIiMIERFVS 

KLHDELTGHMLDMYSVMLTNDNTSRYW^ 

ACIRRPFIiCCKCCYDHVISTSHKLVLSVNPWCNAPGCI^ 

PISFPLCANGQWGLYKNTCVGSDNVTDFNAIATCDWT^ 

LKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNR^ 

YTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSH 

ISDEFSSWAlSryQKVGMQKYSTLQGPPGTGKSHFAIGIxALYYPS^^ 

CEKALKYLPIDKCSRI I PARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEI S 
MATNYDLSVVNARIiRAKHYVYIGDPAQL 

LGTCRRC PAE IVI>IVS ALVYDNKLKAHKDKS AQCFKMF YKGVITHDVS S AINRPQ IGWR 
EFLTRNPAWRKAWISPYNSQNAVASKI^ 

VNRFNVAITRAKIGILC IMSDRDLYDKLQFTSLEI PRRNVATLQAENVTGLFKDCSKI IT 

GLHPTQAPTHLSVDIKFKTEG^TTOIPG 

EEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGV 

NAKPPPGDQFKHLIPIiMYKGLPWNVVRIKIVQMLSiyrLKGLSDRVVFVLWAH 

YFVKIGPERTCCLCDKRATCFSTSSI?IYAC^^ 

HDQHCQVHGNAHVASCDAJMTRCLAVHECFVKRVDW 

VKSALLADKFPVLHDIGNP]^^ 

FTDGVC LFWNCNVDRYPANAIVCRFDTRVL SNLNL PGCDGG SL YVNKHAFHT PAFDKS AF 
TNLKQLPFFYYSDSPCESHGKQVVSDIDYVPIjKS 

AYNMMI SAGFSLWIYKQFDTYNLWNTFTRIjQSL^ I INN 

AVYTKVIXSIDVEIFENKTTLPVNVAFELWAKRNIKPV^ 
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REAPAHVSTIGVCTMTDIAKKPTESAC S SLTVLFIXSRVEGQVDLFRNAKNGVIiITEGSVK 
GLTPSKGPAQASVNGVTLIGESVKTQFNYFKKVDGI IQQLPETYFTQSRDLEDFKPRSQM 
ETDFLELAMDEFIQRYKLEGYAFEHIVYGDFSHGQ 
PMDSTVKNYFITDAQTGSSKCVCSV^ 

MLWCKIX5HVETFYPKI^ASRAWQPGVAMPNIjYKM PKGIMMN 
VAKYTQLCQYIiNTIjTIA^ 

VSDAYSTLIGIXLkTVHTANKWDLI I SDMYDPRTKHVTKENDSKEX5FFTYIjCGFIKQKLAL 

GGSIAVKITEHSWNADLYIOiMGHFSWWTAFVTNVl^ 

MHAJSTYIFWRNTNPIQLSSYSLFDMSKFPLKIiRGTAVM 

ENNRVVVSSDILVNN-TNMFIFLLFLTLTSGSDIjDRCTTFDDVQAPNYTQOT 

PDE I FRSDTL YLTQDLFLPFYSNVTGFHT INHTFGNPVT PFKDG I YFAATEKSNWRGWV 

FGSTMNimSQSVIIimSTlWVIRACNFEI^ 

YI SDAFSIiDVSEKSGOTKHIiREFVFKNKIX5FLYVYKGYQ 

PLGINITlSrFRAILTAFSPAQDIWGTSAAAYFVGYIiKPTTFM^ 

LAELKCSVKSFEIDKGIYQTSNF^^ 

KKI SNCVADYSVL YNSTFF STFKCY GV S ATKLNDLC F SNVYADS FWKGDDVRQ I APGQT 
GVIADYNYKLPDDFMGCVLAWNTRNIDATSTGl^^ SNVPFSP 
IX5KPCTPPALNCYWPLNDYG 

QCVNFNFNGLTGTGVLTPSSKEIFQPFQQFGRDVSDFTDSVRDPKTS 
VITPGTMASSEVAVLYQDVNCTDVSTAIH^ 
OTSYEODIPIGAGIC^SYHTVSIiIiRSTSQKSI^^ 
SITTEVMPVSMAKTSVIXS^ 

VFAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRSFIEDLLFNKV^ 

LGDINARDLICAQKFNGLTVLPPLLTDDMIAAYTAALVSGTATAGWTFGAGAAL PFAM 

QMAYRFNG I GVTQNVL YENQKQ I ANQFNKAI S Q I QE S LTTT S T ALiGKLQDWNQNAQAIjN 

TLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRL^ 

SANLAATKMSECVIXSQSKRVDFCGKGYHLMSFPQAAPHGVV^ 

ICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQIIT 

LQPELDSFKEEIJDKYFKNHTSPDVDLGDISGINASV^ 

QELGKYEQYIKWPWYVWLGFIAGLIAIVMVTILLCC^ 

SEPVLKGVKLHYT-TNLWICL-DFLL^^ 

IXjLLLALHFLLFFRALPK-LRSIKIXSS-PFIRASSSFAIYCCYLLPSIHIFCDSLQVWRR 

NFCTSMP-YIFYNASTHVEL^ 

QIQLSLLKVTAFQHQNSKKTTK]^^ 

VLKMLHSSSLTSI/LKTHRMCKYTQSTALQEM 

CTHSFRKKQVR — LIAYFFFLLSV^SC-SH-PSIiliRFDCVRTAAIIiLT-V — NQRFTSTR 
VKKI -TLLKEFLIFWSKRTNYYYYSVWNFNIAYHGRQRYYYR-GA-TTPGTMEPSNR 
PSLDYVTTICLF-SEQVFVHNI^CFPIiALVASOT 

RLDVA-LLRCFLQAVCSYPIiNVVIQPRNIQISSQCASPGDNCDQTAHGK-TCHWCC 
LANGRTLPRAL-H-GPAKRDHCGYITNAFLLQIRSVAACRH-FRFCCIQPLPYWKL- 
RPRR-QRQYCFASTVSDNRCFILLTSRLQ-QRY-LSLi-GLSGLLFGIIiTIi — VQ — DNYL 
SL-IiRRI IRS -MMKNIiWS - 1 IHKTNMKI ILFLTIilWTSCEIiYHYQEC^GTTVLIiKEPC 
PSGTYEGNSPFHPLADNKFALTCTSTHFAFACADGTRHTYQIjRARSV 
ELYSPLFLIVAALVFLILCFTIKRKTE-MSSL-IiTSICAF--PFCYSLF — CLLYFGFHSK 
SRI-KNLVPKSKRT-NFSLF-LVFLYAVAYAL-YSAVHLIOTjM 

AWIjC ALGKVTj PFHRWHTMVQTCTPNVT INC QDPAGGAL I ARCWYLHEGHQTAAFRDVLVV 
LNKRTN-NV — WTP I KPT -CP PHYIWWTHRFN- Q - PEWRTQWGKAKTAPTPRFTQ - YCVL 
VHSSHSAWQGGT-IPSRPGRSNQHQ-WSR-Pisn^^ 

PRNWPRSFTSLRR-QRRHRMGCN-GSLEYTQRPHWHPQS — QCCHRATTSSRNNIAKRLIi 

RRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG-RRW-NCPRAIAARQIEPA-E 

QSFW-RPTTTRPNCH-EICC-GI-KASPKTYCHK^ 

QTRN-LQTLAANCTICSKCIiC^ 

AEQAH-RIQNIPTNRA-KGQKEKD — S SAFAAETKEAAHCDS S SCG - HG- FLQTTSKFHE 
WSFC - FNSGINTHDDHTRQMGYVNVFAI PFT I HS LLLCRMNSRN-TAQVGLVNFNLT - QS 
LINV-H-GGLERATTFSSRPRGVRSRVQ-IMLGRAAYMEEP-CVKLILVV^ 
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FIGURE 25 

PTPRTLDAVCGCINVFKRVCGVSAARLTPC^ 

KDEEGNLLDSYFWKRHTMSNYQHEET^ 

ADLVYALRHFDEGNCiyriiKEILVTYN 

QFCDAMRDAG WGVLTLDNQDLNGNVi^^ ILTLTRALAAESH 
MDADLAKPLIKWDLLKYD 

PTSFGPLVRKIFVIX?VPFWSTGYHFRELGVVH^ 

LDKRTTCFSVAALTNNVAFQTVKPGNFNKDFTO 

YYRYNLPTMCDIRQLLFVVEWDKYFIX^ 

YEDQDALFAYTKRNVT PTI TQMNLKYAI SAKNRARTVAGVS I C STMTNRQFHQKLLKS IAATRGAT 
WIGTSKFYGGWHNMLKTVYSDVETP 

YRLAl^CAQVLSEMVMCGGSLYVKPGGTSSGDAOTAYANSVFNICQA 
YVRNLQHRLYECLYRNRDVDHEFVDEFYAYL^ 
VLYYQNNVFMSEAKCWTBTDLTKGPHEFCSQ 
IXjTLMIERFVSLAIDAYPLTKHPNQEYADVFH^ 

EPEFYEAMYTPHTVLQAVGACVLC^ STSHKLVLSVNPYVC 
NAPGCDVTDVTQLYLGGMSYYCKSHKPPI SFPLCANGQVFGLYKNTCVGSDNVTDF 
AGDYILANTCTERLKLFAAETL 
TGYRVTKNSKVQIGEYTFEKGDYGDAWYRGTT^ 

ITGLYPTLNISDEFSSNVAl^QKVGMQKYSTLQGPPGTGKSHFAIGI^ 

DALCEKALKYLPIDKCSRIIPARARVECFD^ 

NTYDLSVVNARLRAKHYVYIGDPAQLPA 

ErVDWSALVYDl^KAHKDKSAQCFKMFYKGVITro 

SPYNSQNAVASKILGLPTQTVDSSQGSEYDYVTFTQTTETAHSC^ 

RDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKIITGLHPTQAPTHLSVD 

GIPIODMTYRRLISMMGFKM^ ■ 

GFSTGVNLVAVPTGYVDTENNTO 

GLSDRWFVLWAHGFELTSMKYFVKIG 

IDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCIiAVHECFVK^ 
ACRKVQHMVVKSALLADKFPVIiHDIGNPKAI^ 

HDKFTIXJVCLFWNCNVDRYPAN^ • 
KQLPFFYYSDSPCESHGKQWSDIDYVPLKSATCITRC^ 

FSLWIYKQFOTYIWjWNTFTRLQSLEN^ I INNAVYTKVDGIDVEIFE 

NKTTLFVWAFELWAKITOIK^^ 

PTESACSSLTVLFDGRVEGQVDLFRNARNGVL^ 

NYFKKVDGI IQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEF IQRYKLEGYAFEHIVYGDFSH 

GQLGGLHLMIGLAKRSQDSPLKLEDFIPMD^ 

QDLSVISKVVKVTIDYAEISFMLWCKDGHVETFYPKLQ 

NYGENAVIPKGIMMWAICfTQLCQYLNTLTLA^ 

LVDSDLNDFVSDAYSTLIGIXIATVHTANKWDLIISDMYDP 

LALGGSIAVKITEHSWNADLYKI^ 

NYIFWRm^IQLSSYSLFDMSKFPLKLRGTATO^ 

DILVNN^TNMFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEI 

LFLPFYSNVTGFHTINHTFGNPVI PFKDGI YFAATEKSNVVRGWVFGSTMNNKSQ SVT I INNSTNV 

VIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEYISDAFSLDV^ 

KDGFLYVYKGYQPIDVVRDLPSGFNTLKPIFKLPL^ 

YLKPTTFMLKYDENGTITDAVD^ 

LCPFGEVFNATKFPSOTAWERKKISNC^ADYSV^^ 

WKGDDWQIAPGQTGVIADYNYKLPDDFMGCVIAWOT 

DISlWPFSPIXSKPCTPPALNCrZWPLNDYGFYT^ 

IKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTO 

PGTNASSEVAVLYQDWCTDVSTAIHADQLTPAWRIYSTGNNVFQTQ 

IGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSS IAYSNNTIAI PTNFSI S ITTEVMPVSMAKTS 

VDCNMYICGDSTECAl^LLQYGSFCTQLNR^ 

FSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQY^^ 
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DMIAAYTAALVSGTATAGWTFGAGAAI^I PFAMQMAYRFNGIGVTQl^YENQKQIANQFNKAI SQ 

IQESLTTTSTALGKLQDVVNQNAQAIx^TTLVKQLSSNFGAISSVLNDI 

RLQSLOTTVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKG^ 

TYVPSQERNFTTAPAIOTEGKAYFPRBGVFVFNGT^ 

I INNTVYDPI^PEIJDSFKEErJ)KYFKNHTSPDVDI^^ 

IDLQELGK!fEQYIKWPWYVVnjGFIAGLIArVMVTILLGCMTSCCSCLKGA 

VLKGVKLHYT* 



FIGURE 26 



FIGURE 26A 



FIGURE 26B 
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FIGURE 27 
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FIGURE 28 
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FIGURE 31 
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FIGURE 35 
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FIGURE 36A 
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FIGURE 36C 
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FIGURE 37 
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FIGURE 39 

FIGURE 39A FIGURE 39B 
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FIGURE 40 
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FIGURE 41 
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FIGURE 42 
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FIGURE 43 
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FIGURE 46 
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FIGURE 48 
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FIGURE 50 
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f) Leader(NadA)-HR1 -intermediate -domain-HR2-anchor(NadA) 

g) Leader(NadA)-HRlHntennedialB-domain-HR2 



FIGURE 51 




(control) +MF59 SARS + MF59 
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A, 293 cefl lysates 



FIGURE 52 

B . COS7 cefl culture supematants 
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FIGURE 54 
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Xbal Blnl 



Isolate Xbal-Blnl 2440 bp fragment 
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FIGURE 59 




Isolate Agel-Sall 
f ragment ' 



1943 bp 



Sail 



LigateintopS93-l 
Xbal-Sall vector 



Ligate with HindUI- 
Agel Sl-l+Sl-2 oligos 
into pSP72 HiniUTT- 
Sall subcloning vector. 



pSP72SARS 
Spike SI #2 



FIGURE 60 




Isolate the Hindffl- 
SaU 1973 bp fragment 



BamHI 



1365 bp 



fiindm 



Hindm 



pikfeSl 
1973 bp 



Sail 



Ligate ADH2/GAPDH Promoter BamHI- 
Hindm + Spike SI HindHI-SaU fragments in 
pBS24.1 BamHtSall vector 
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FIGURE 61 




Isolate BlntDral 
1056 bp fragment 

68 bp 

mmmmMm + ■ 

BM 1065 bp Dral Dial Sail 

/Ligate fragment + kinase d DRS1+2 
oUgos into pT7Blue2 BmtSa II vector 




Xbal 3109 bp PcH Pcil 457 bp Sail 



ligate Xbal-Pcil 31 09 bp + PcO-Sall 457 bp 
fragments into p893-l Xbal-Sall vector 
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FIGURE 63 



Isolate Agel-Blnl 2416 bp 
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Hindm Agel 




Isolate Ncol-Blnl 2044 bp 
fragment 
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2044 bp 



BtDl 



Ugate Hincffll-Agel Sl-l+Sl-2 oligos 
+ Agel-Blnl 1195 fragment into 
pT7Blue2 HimUQ-Blnl subc toning 
vector 



f pT7Blue2JSARS) 
1195 5'HindIII- 
Blnl#10 



Hindm 



Ncol 
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BamHI 



1163 bp 



Hindm Ncol 

+ urn + 1 

402 bp 



2044 bp 



9 + 

Blnl 



1126 bp SaU 



Ligate Bamm-Hindffl ADH2/GAPDH promoter, 
Hindm-Ncol, Ncol-Blnl, and Blnl-Sall fragments 
into pBS24.1 BamHI-Sall yeast expression vector. 
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FIGURE 65 

1 10 
MSDLDRCTTF 
AAGCTTACAAAACAAA ATG AGT GAC CTT GAC CGG TGC ACC ACT TTT 

20 

DDVQAPNYTQHTSSM 
GAT GAT GTT CAA GCT CCT AAT TAC ACT CAA CAT ACT TCA TCT ATG 

30 40 
RGVYYPDEIFRSDTL 
AGG GGG GTT TAC TAT CCT GAT GAA ATT TTT AGA TCA GAC ACT CTT 

50 

YLTQDLFLPFYSNVT 
TAT TTA ACT GAG GAT TTA TTT CTT CCA TTT TAT TCT AAT GTT ACA 

60 70 
GFHTINHTFGNPVIP 
GGG TTT CAT ACT ATT AAT CAT ACG TTT GGC AAC CCT GTC ATA CCT 

80 

FKDGIYFAATEKSNV 
TTT AAG GAT GGT ATT TAT TTT GCT GCC ACA GAG AAA TCA AAT GTT 

90 100 
VRGWVFGSTMNNKSQ 
GTC CGT GGT TGG GTT TTT GGT TCT ACC ATG AAC AAC AAG TCA CAG 

110 

SVI I INNSTNVVIRA 
TCG GTG. ATT ATT ATT AAC AAT TCT ACT AAT GTT GTT ATA CGA GCA 

120 130 
CNFELCDNPFFAVSK 
TGT AAC TTT GAA TTG TGT GAC AAC CCT TTC TTT GCT GTT TCT AAA 

140 

PMGTQTHTMI FDNAF 
CCC ATG GGT ACA CAG ACA CAT ACT ATG ATA TTC GAT AAT GCA TTT 

150 160 
NCTFEYISDAFSLDV 
AAT TGC ACT TTC GAG TAC ATA TCT GAT GCC TTT TCG CTT GAT GTT 

170 

SEKSGNFKHLREFVF 
TCA GAA AAG TCA GGT AAT TTT AAA CAC TTA CGA GAG TTT GTG TTT 

180 190 
KNKDGFLYVYKGYQP 
AAA AAT AAA GAT GGG TTT CTC TAT GTT TAT AAG GGC TAT CAA CCT 
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200 

IDVVRDLPSGFNTLK 
ATA GAT GTA GTT CGT GAT CTA CCT TCT GGT TTT AAC ACT TTG AAA 

210 220 
PIFKLPLGINITNFR 
CCT ATT TTT AAG TTG CCT CTT GGT ATT AAC ATT ACA AAT TTT AGA 

230 

AILTAFSPAQDIWGT 
GCC ATT CTT ACA GCC TTT TCA CCT GCT CAA GAC ATT TGG GGC ACG 

240 250 
SAAAYFVGYLKPTTF 
TCA GCT GCA GCC TAT TTT GTT GGC TAT TTA AAG CCA ACT ACA TTT 

260 

MLKYDENGTITDAVD 
ATG CTC AAG TAT GAT GAA AAT GGT ACA ATC ACA GAT GCT GTT GAT 

270 280 
CSQNPLAELKCSVKS 
TGT TCT CAA AAT CCA CTT GCT GAA CTC AAA TGC TCT GTT AAG AGC 

290 

FEIDKGIYQTSNFRV 
TTT GAG ATT GAC AAA GGA ATT TAC CAG ACC TCT AAT TTC AGG GTT 

300 310 
VPSGDVVRFPNITNL 
GTT CCC TCA GGA GAT GTT GTG AGA TTC CCT AAT ATT ACA AAC TTG 

320 

CPFGEVFNATKFPSV 
TGT CCT TTT GGA GAG GTT TTT AAT GCT ACT AAA TTC CCT TCT GTC 

330 340 
YAWERKKISNCVADY 
TAT GCA TGG GAG AGA AAA AAA ATT TCT AAT TGT GTT GCT GAT TAC 

350 

SVLYNSTFFSTFKCY 
TCT GTG CTC TAC AAC TCA ACA TTT TTT TCA ACC TTT AAG TGC TAT 

360 370 
GVSATKLNDLCFSNV 
GGC GTT TCT GCC ACT AAG TTG AAT GAT CTT TGC TTC TCC AAT GTC 

380 

YADSFVVKGDDVRQ I 
TAT GCA GAT TCT TTT GTA GTC AAG GGA GAT GAT GTA AGA CAA ATA 

390 400 
APGQTGVIADYNYKL 
GCG CCA GGG CAA ACT GGT GTT ATT GCT GAT TAT AAT TAT AAA TTG 



WO 2004/092360 



114/193 



PCT/DS2004/011710 



410 

PDDFMG CVLAWNTRN 
CCA GAT GAT TTC ATG GGT TGT GTC CTT GCT TGG AAT ACT AGG AAC 

420 430 
IDATSTGNYNYKYRY 
ATT* GAT GCT ACT TCA ACT GGT AAT TAT AAT TAT AAA TAT AGG TAT 

440 

LRHGKLRPFERDISN 
CTT AGA CAT GGC AAG CTT AGG CCC TTT GAG AGA GAC ATA TCT AAT 

450 460 
VPFSPDGKPCTPPAL 
GTG CCT TTC TCC CCT GAT GGC AAA CCT TGC ACC CCA CCT GCT CTT 

470 

NCYWPLNDYGFYTTT 
AAT TGT TAT TGG CCA TTA AAT GAT TAT GGT TTT TAC ACC ACT ACT 

480 490 
GIGYQPYRVVVLSFE 
GGC ATT GGC TAC CAA CCT . TAC AGA GTT GTA GTA CTT TCT TTT GAA 

500 

LLNAPATVCGPKLST 
CTT TTA AAT GCA CCG GCC ACG GTT TGT GGA CCA AAA TTA TCC ACT 

510 520 
DLIKNQCVNFNFNGL 
GAC CTT ATT AAG AAC CAG TGT GTC AAT TTT AAT TTT AAT GGA CTC 

530 

TGTGVL TP SSKRFQ P 
ACT GGT ACT GGT GTG TTA ACT CCT TCT TCA AAG AGA TTT CAA CCA 

540 550 
FQQFGRDVSDFTDSV 
TTT CAA CAA TTT GGC CGT GAT GTT TCT GAT TTC ACT GAT TCC GTT 

560 

RDPKTSEILDISPCS 
CGA GAT CCT AAA ACA TCT GAA ATA TTA GAC ATT TCA CCT TGC TCT 

570 580 
FGGVSVITPGTNAS S 
TTT GGG GGT GTA AGT GTA ATT ACA CCT GGA ACA AAT GCT TCA TCT 

590 

EVAVLYQDVNCTDVS 
GAA GTT GCT GTT CTA TAT CAA GAT GTT AAC TGC ACT GAT GTT TCT 

600 610 
TAIHADQLTPAWRIY 
ACA GCA ATT CAT GCA GAT CAA CTC ACA CCA GCT TGG CGC ATA TAT 
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620 

STGNNVFQTQAGCLI 
TCT ACT GGA AAC AAT GTA TTC CAG ACT CAA GCA GGC TGT CTT ATA 

630 640 
GAEHVDT SYECDIPI 
GGA GOT GAG CAT GTT GAT ACT TCT TAT GAG TGC GAC ATT CCT ATT 

650 

G, AGICASYHTOC SEQ ID NO: 9799 

GGA GCT GGC ATT TGT GOT AGT TAC CAT ACA TAA TGAGTCGAC SEQ XD NO: 9800 

Translated Mol. Weight = 72525.52 



FIGURE 66 

1 10 
MSDLDRCTTF 
AAGCTTACAAAACAAA ATG AGT GAC CTT GAC CGG TGC ACC ACT TTT 

20 

DDVQAPNYTQHTSSM 
GAT GAT GTT CAA GCT CCT AAT TAC ACT CAA CAT ACT TCA TCT ATG 

30 40 
RGVYYPDEIFRSDTL 
AGG GGG GTT TAC TAT CCT GAT GAA ATT TTT AGA TCA GAC ACT CTT 

\ 50 
Y ' 2i T Q D .L F L P F Y S N V T 
TAT TTA ACT CAG GAT TTA TTT CTT CCA TTT TAT TCT AAT GTT ACA 

60 70 
GFHTINHTFGNPVIP 
GGG TTT CAT ACT ATT AAT CAT ACG TTT GGC AAC CCT GTC ATA CCT 

80 

FKDGIYFAATEKSNV 
TTT AAG GAT GGT ATT TAT TTT GCT GCC ACA GAG AAA TCA AAT GTT 

90 100 
VRGWVFGSTMNNKSQ 
GTC CGT GGT TGG GTT TTT GGT TCT ACC ATG AAC AAC AAG TCA CAG 

110 

SVIIINNSTNVVIRA 
TCG GTG ATT ATT ATT AAC AAT TCT ACT AAT GTT GTT ATA CGA GCA 

120 130 
CNFELCDNPFF AVSK 
TGT AAC TTT GAA TTG TGT GAC AAC CCT TTC TTT GCT GTT TCT AAA 

140 

PMGTQTHTMIFDNAF 
CCC ATG GGT ACA CAG ACA CAT ACT ATG ATA TTC GAT AAT GCA TTT 

150 160 
NCTFEYI SDAFSLDV 
AAT TGC ACT TTC GAG TAC ATA TCT GAT GCC TTT TCG CTT GAT GTT 
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170 

SEKSGNFKHLREFVF 
TCA GAA AAG TCA GGT AAT TTT AAA CAC TTA CGA GAG TTT GTG TTT 

180 190 
KNKDGFLYVYKGYQP 
AAA AAT AAA GAT GGG TTT CTC TAT GTT TAT AAG GGC TAT CAA CCT 

200 

IDVVRDLPSGFNTLK 
ATA GAT GTA GTT CGT GAT CTA CCT TCT GGT TTT AAC ACT TTG AAA 

210 220 
PIFKLPLGINITNFR 
CCT ATT TTT AAG TTG CCT CTT GGT ATT AAC ATT ACA AAT TTT AGA 

230 

AILTAFSPAQDIWGT 
GCC ATT CTT ACA GCC TTT* TCA CCT GOT CAA GAC ATT TGG GGC ACG 

240 250 
SAAAYFVGYLKPTTF 
TCA GCT GCA GCC TAT TTT GTT GGC TAT TTA AAG CCA ACT ACA TTT 

260 

M LKYDENGT I TDAVD 
ATG CTC AAG TAT GAT GAA AAT GGT ACA ATC ACA GAT GCT GTT GAT 

270 280 
CSQNPLAELKCSVKS 
TGT TCT CAA AAT CCA CTT GCT GAA CTC AAA TGC TCT GTT AAG AGC 

290 

FE/IDKGIYQTSNFRV 
TTT GAG ATT GAC AAA GGA ATT TAC CAG ACC TCT AAT TTC AGG GTT 

300 310 
VPSGDVVRFPNITNL 
GTT CCC TCA GGA GAT GTT GTG AGA TTC CCT AAT ATT ACA AAC TTG 

320 

CPFGEVFNATKFPSV 
TGT CCT TTT GGA GAG GTT TTT AAT GCT ACT AAA TTC CCT TCT GTC 

330 340 
YAWERKKI SNCVADY 
TAT GCA TGG GAG AGA AAA AAA ATT TCT AAT TGT GTT GCT GAT TAC 

350 

SVLYNSTFFSTFKCY 
TCT GTG CTC TAC AAC TCA ACA TTT TTT TCA ACC TTT AAG TGC TAT 

360 370 
GVSATKLNDLCFSNV 
GGC GTT TCT GCC ACT AAG TTG AAT GAT CTT TGC TTC TCC AAT GTC 

380 

YADSFVVKGDDVRQI 
TAT GCA GAT TCT TTT GTA GTC AAG GGA GAT GAT GTA AGA CAA ATA 

390 400 
APGQTGVIADYNYKL 
GCG CCA GGG CAA ACT GGT GTT ATT GCT GAT TAT AAT TAT AAA TTG 
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410 

PDDPMGCVLAWNTRN 
CCA GAT GAT TTC ATG GGT TGT GTC CTT GCT TGG AAT ACT AGG AAC 

420 430 
IDAT STGNYNY KYRY 
ATT GAT GCT ACT TCA ACT GGT AAT TAT AAT TAT AAA TAT AGG TAT 

440 

LRHGKLRPFERDISN 
CTT AGA CAT GGC AAG CTT AGG CCC TTT GAG AGA GAC ATA TCT AAT 

450 460 
VPFSPDGKPCTPPAL 
GTG CCT TTC TCC CCT GAT GGC AAA CCT TGC ACC CCA CCT GCT CTT 

470 

NCYWPLNDYGFYTTT 
AAT TGT TAT TGG CCA TTA AAT GAT TAT GGT TTT TAC ACC ACT ACT 

480 490 
GI GYQPYRVVVLSFE 
GGC ATT GGC TAC CAA CCT TAC AGA GTT GTA GTA CTT TCT TTT GAA 

500 

LLNAPATVCGPKLST 
CTT TTA AAT GCA CCG GCC ACG GTT TGT GGA CCA AAA TTA TCC ACT 

510 520 
DLIKNQCVNFNFNGL 
GAC CTT ATT AAG AAC CAG TGT GTC AAT TTT AAT TTT AAT GGA CTC 

530 

TGTGVLTPSSKRFQP 
ACT GGT ACT GGT GTG TTA ACT CCT TCT TCA AAG AGA TTT CAA CCA 

540 550 
FQQFGR DVSDFTDSV 
TTT CAA CAA TTT GGC CGT GAT GTT TCT GAT TTC ACT GAT TCC GTT 

560 

RDPKTSEILD ISPCS 

CGA GAT CCT AAA ACA TCT GAA ATA TTA GAC ATT TCA CCT TGC TCT 

•» 

570 580 
FGGVSVITPGTNASS 
TTT GGG GGT GTA AGT GTA ATT ACA CCT GGA ACA AAT GCT TCA TCT 

590 

EVAVLYQDVNCTDVS 
GAA GTT GCT GTT CTA TAT CAA GAT GTT AAC TGC ACT GAT GTT TCT 

600 610 
TAIHADQLTPAWRIY 
ACA GCA ATT CAT GCA GAT CAA CTC ACA CCA GCT TGG CGC ATA TAT 

620 

STGNNVFQTQAGCLI 
TCT ACT GGA AAC AAT GTA TTC CAG ACT CAA GCA GGC TGT CTT ATA 

630 640 
GAEHVDTSYECDIPI 
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GGA GCT GAG CAT GTC GAC ACT TCT TAT GAG TGC GAC ATT CCT ATT 

650 

GAGICASYHTVSLLR 
GGA GCT GGC ATT TGT GCT AGT TAC CAT ACA GTT TCT TTA TTA CGT 

660 670 
STSQKSIVAYTMSLG 
AGT ACT AGC CAA AAA TCT ATT GTG GCT TAT ACT ATG TCT TTA GGT 

680 

ADSSIAYSNNTIAIP 
GCT GAT AGT TCA ATT GCT TAC TCT AAT AAC ACC ATT GCT ATA CCT 

690 700 
TNFSISITTEVMPVS 
ACT AAC TTT TCA ATT AGC ATT ACT ACA GAA GTA ATG CCT GTT TCT 

710 

MA KT SVDCNMYI CGD 
ATG GCT AAA. ACC TCC GTA GAT TGT AAT ATG TAC ATC TGC GGA GAT 

720 730 
STECANLLL QYGSFC 
TCT ACT GAA TGT GCT AAT TTG CTT CTC CAA TAT GGT AGC TTT TGC 

740 

TQLNRALSGIAAEQD 
ACA CAA CTA AAT CGT GCA CTC TCA GGT ATT GCT GCT GAA CAG GAT 

750 760 
RNTREVFAQVKQMYK- 
CGC AAC ACA CGT GAA GTG TTC GCT CAA GTC AAA CAA ATG TAC AAA 

770 

T P T L KY F G G FNF S Q I 
ACC CCA ACT TTG AAA TAT TTT GGT GGT TTT AAT TTT TCA CAA ATA 

780 790 
LPDPLKPTKRSFIED 
TTA CCT GAC CCT CTA AAG CCA ACT AAG AGG TCT TTT ATT GAG GAC 

800 

LLFNKVTLADAGFMK 
TTG CTC TTT AAT AAG GTG ACA CTC GCT GAT GCT GGC TTC ATG AAG 

810 820 
QYGECIiGDINARDLI 
CAA TAT GGC GAA TGC CTA GGT GAT ATT AAT GCT AGG GAC CTC ATT 

830 

CAQKFNGLTVLPPLL 
TGT GCG CAG AAG TTC AAT GGA CTT ACA GTG TTG CCA CCT CTG CTC 

840 850 
TDDMIAAYTAALV SG 
ACT GAT GAT ATG ATT GCT GCC TAC ACT GCT GCT CTA GTT AGT GGT 

860 

TATAGWTFGAGAALQ 
ACT GCC ACT GCT GGA TGG ACA TTT GGT GCT GGC GCT GCT CTT CAA 



870 



880 
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IPFAMQMAYRFNGIG 
ATA CCT TTT GCT ATG CAA ATG GCA TAT AGG TTC AAT GGC ATT GGA 

890 

VTQNVLYENQKQ IAN 
GTT ACC CAA AAT GTT CTC TAT GAG AAC CAA AAA CAA ATC GCC AAC 

900 910 
QFNKAISQIQESLTT 
CAA TTT AAC AAG GCG ATT AGT CAA ATT CAA GAA TCA CTT ACA ACA 

920 

TSTALGKLQDVVNQN 
ACA TCA ACT GCA TTG GGC AAG CTG CAA GAC GTT GTT AAC CAG AAT 

930 940 
AQALNTLVKQLSSNF 
GCT CAA GCA TTA AAC ACA CTT GTT AAA CAA CTT AGC TCT AAT TTT 

950 

GAISS VIiNDIL SRLD 
GGT GCA ATT TCA AGT GTG CTA AAT GAT ATC CTT TCG CGA CTT GAT 

960 970 
KVEAEVQIDRLITGR 
AAA GTC GAG GCG GAG GTA CAA ATT GAC AGG TTA ATT ACA GGC AGA 

980 

IiQSLQTYVTQQLIRA 
CTT CAA AGC CTT CAA ACC TAT GTA ACA CAA CAA CTA ATC AGG GCT 

990 1000 
AEIRASAN LAATKMS 
GCT GAA ATC AGG GCT TCT GCT AAT CTT GCT GCT ACT AAA ATG TCT 

1010 

ECVLGQSKRVDFCGK 
GAG TGT GTT CTT GGA CAA TCA AAA AGA GTT GAC TTT TGT GGA AAG 

1020 1030 
GYHLMS F PQAA P HGV 
GGC TAC CAC CTT ATG TCC TTC CCA CAA GCA GCC CCG CAT GGT GTT 

1040 

VFLHVTYVPSQERNF 
GTC TTC CTA CAT GTC ACG TAT GTG CCA TCC CAG GAG AGG AAC TTC 

1050 1060 
TTAPAI CHEGKAYFP 
ACC ACA GCG CCA GCA ATT TGT CAT GAA GGC AAA GCA TAC TTC CCT 

1070 

REGVFVFNGTSWFIT 
CGT GAA GGT GTT TTT GTG TTT AAT GGC ACT TCT TGG TTT ATT ACA 

1080 1090 
QRNFFSPQIITTDNT 
CAG AGG AAC TTC TTT TCT CCA CAA ATA ATT ACT ACA GAC AAT ACA 

1100 

FVSGNCDVVIGI INN 
TTT GTC TCA GGA AAT TGT GAT GTC GTT ATT GGC ATC ATT AAC AAC 
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1110 1120 
TVYDPLQPELDSFKE 
ACA GTT TAT GAT CCT CTG CAA CCT GAG CTT GAC TCA TTC AAA GAA 

1130 

ELDKYPKNHTSPDVD 
GAG CTG GAC AAG TAC TTC AAA AAT CAT ACA TCA CCA GAT GTT GAT 

1140 1150 
FGDISGINASVVNIQ 
TTT GGC GAC ATT TCA GGC ATT AAC GCT TCT GTC GTC AAC ATT CAA 

1160 

KEIDRLNEVAKNLNE 
AAA GAA ATT GAC CGC CTC AAT GAG GTC GCT AAA AAT TTA AAT GAA 

1170 1180 
SLIDLQELGKYEQYI 
TCA CTC ATT GAC CTT CAA GAA TTG GGA AAA TAT GAG CAA TAT ATT 

1183 

K W P OC SEQ ID HO: 9801 

AAA TGG CCT TAA TGAGTCGAC SEQ ID NO: 9802 

Translated Mol. Weight = 131315. 20 ' 



FIGURE 67 

FIGURE 67 A 




FIGURE 67B 



rt sore -rare 



(Mia) 

669 ► 
440 ► 



Tinner iggj 



232 



Bimer? j t;.- t 



15ft ► 



Monomer > , ^ ^ 



WO 2004/092360 



PCT/US2004/011710 



121/193 
FIGURE 68 



< 

CO 
□ 
Ul 

c 

I 

o 

* 



140000 -l 
120000 
100000 

80000 

60000 

40000 

20000 



ill 





1 2 
FIGURE 69 




tW&G5 



FIGURE 70 




WO 2004/092360 



122H93 



PCT/DS2004/011710 



1000 T- 



FIGURE 71 



N 



3 
z 



100 



to 



»v ' ■ if 




nSdTC: protein 



rtStfTC DN*DNAPLG,DNAPliQ 



FIGURE 72 

10000 t — 




WO 2004/092360 



123/193 



PCT/US2004/D11710 



FIGURE 73 



CHO denes: 
250 ► 



F1F2 



64 ► 



CHOdaaes: 12 - 11 T2 X3 

(M>a) 
250 ► 



98* 

so* 





12 3 


1 2 3 4 5 








FIGURE 74 




FIGURE 74A 


L 




TMCy 




1 


SI S2 


I w$$?p. 
MBHB lii 



1 14 Receptor Binding 
Neutralization 



Oligomerization 
Membrane fusion 



FIGURE 74B 



SI 



S2 



1255 



1195 



FIGURE 75 



FIGURE 75A 



PNGaseF: - + 
(kDa) 

250 — 

150 — 
100—' 
75 — 

50 — 

35 — 

25 — 



+ - 



+ 



< 



FIGURE 75B 



1 2 3 4 5 6 



DTT : - - - 

80°C, 5min : + + + 

(kDa) 

250 — 

98 — 
64— 
50 — 

36 — 



12 3 



WO 2004/092360 



124/193 
FIGURE 76 



PCT/US2004/011710 









Endo H : 


- + 


+ 

• \ \ - : " .'v'/j.V.--.-'. 


gpl60 ► 




:.|(i^i:^^'<l gpl70 










1 2 


3 4 



FIGURE 77 



DTT: 
Boil : 

(kDa) 

' 669— 
440— 



+ + 

+ 



232 — 



158 — 



12 3 4 



FIGURE 78 



4? $ 



RT RT 80°C 100°C 
(kDa) ^|?^.'-Hf^^| 

669 — a£&$3atoffi 
• 440 -l^fpi^P'Jv 'kl 



232 
158 



2 3 4 



WO 2004/092360 



125/193 



PCT/US2004/011710 



FIGURE 79 

RT 80°C 100°C 




WO 2004/092360 



PCT/DS2004/011710 



126/193 

FIGURE 81 FIGURE 82 
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FIGURE 84 
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FIGURE 86 
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FIGURE 106 
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FIGURE 109 
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FIGURE 111 
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FIGURE 113 

5'3' Frame 1 

PKDMTYVDSSL-WS-ITKSMVTLICLSPA 

SS-DFLQVLT--LYRLVMLTmTQNSPELMHKPPPVSSLNILYHSCIKACPGM 

YKCSVIH-KDCQTESCSSFGRMMiSIJIQ-STLSRLDLKERWCVTW 

61 ILWVLTMSITHL-LMFSSGGFTGNLSE-P- PTLPGTWKCTCGLWML 



5>3> Frame 2 

QRT-PT-mYIXSFQNELPSQWLP-YraPRRSYSSRSC^WL-CRGLSCN-RCCGY-PT 
SPARimC-LSSCTDWLC-H-K-mHQS-CmHQ-AV-TSYTTHV-RLALECSAY-D 
STMQ-YTERIWQSRVRPLGAWL-AYINEVI£^ 
LESFCGF-LCL-PIYD-CSAVGALRVTFQSNHDQHCQVH6NAHVG-L-C 

5' 3' Frame 3 

KGHDLRRLISMGFKMNYQWGYPMFITREEAIRHVRAMGFDVEGCHATRDAVGT^ 
LQIXSFSTGVI^VAVPTGYVDTENNTKFTRVMOT 
VQI^SDTLKGLSDRVVFVLWMGFELTSMKYFVKIGPFJITCCLCDKRATC 
WNHSVGFDYVYNPFMIDVQQWGLYG- PFRVTMTNIARYMEMHMWASCDA 

3' 5" Frame 1 

-HHN-PTCAFPCTWQCWSWLL-KVTRKAPTA^ 

KLHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTILSVYH-AFVLS-YALHSRAS 
LYT-W-DV-TAHWVmFVH-LW-ILCYFQCQHNQSVQLLS-HL-KILAGEVG-YPQHL-L 
HDSPLHQSQSTHERDE-LLRG — TY-GNH-LGNSF-NPS-R-VYVGHVLW 

3*5" Frame 2 

SITTSPHVHFHVPGWGHGYSmmPPLI^IMIKWVIDIVKTHRMIPAGISI--SRKA 
SCTFVTOTITlSFRSNLDKVLH-CKLKAMRPKDFJroSV-QSFQ 

FIHEWYKMFKLLTGGGLCINSGEFCVIFSWITSRYSY-VNTCRKS-LER-VSTHSISSC 
MTALYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDEST-VMSF 

3*5' Frame 3 

ASQLAHMCISMYLAMLVMVTLKG^ 

VARLSMQHVLSGPILTRYFIDVSSKPCAQRTNTTLSDNPFSVSLSICTILIRTTFQGKP 
LYMSGIRCLNCSLWCALTLVNFVLFSVST-PVGTATKLTPVF^SMGPJjWTASLVA 
-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLRRSCPL 
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FIGURE 114 

5'3' Frame 1 

YRIUjISMGFKMNYQVNCTPNMFIT^ 
FSTGmVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLI 

5'3' Frame 2 

TVDSSL-WSK-ITKSMVTLICLSPAKKLFVTFVRGLALM-RAV^ 
FLQVLT — LYRLVMLTLKITQNSPELMQNLHQVTSLNILY 

5*3' Frame 3 

P-THLYIXSFQNELPSQWLP-YVYHPRRSYSSRSC^/DWL-CRGLSCN-RCCGY-PTSPARI 
FYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR- PV-TSYT 

3'5' Frame 1 

GIRCLNWSPGGGFMjTLVNSVL^ 
TSKPIHART-RIASSRVINILG- PLTW-FILKPIIEMSLR 

3*5' Frame 2 

v-dv-tghlvevlh-lw-ilcyfqcqhnqsvqlls-hl-kilagevg-ypqhl-lhdspl 
hqsqstherde-llrg- -ty-gnh-lgnsf-nps-r-vyg 

3'5' Frame 3 

YKMFI&VTVMRFCINSGEFC^IFSm 
IKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTV 
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FIGURE 115 (contd.) 
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FIGURE 115 (contd.) 
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FIGURE 115 (contd.) 
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FIGURE 116 
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FIGURE 118 
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FIGURE 118 (contd.) 
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FIGURE 119 

tagtcaaaacccacagaatgattccagcaggcataagtatctgatgaagtagaaaagcaa 

-VSDEVEKQ 

gttgcacgtTTGtcacacagacaacacgttctttcaggtccaatcTTGacaaagtacttc 
V A R _L_ S HRQHVLSGP I _L_ T K Y F 

attgatgtaagctcaaagccatgcgcccaaaggacgaacacgactctgtctgacaatcct 
IDVSSKPCAQRTNTTLSDNP 

ttcagtgtatcactgagcatttgtactatcttaatacgcactacattccagggcaagcct 
FSVSLS ICTILIRTTFQGKP 

ttatacATGagtggtataagatgtttaaactgctcactggtggaggtttgtgcattaact 
L Y _M_ S 6IRCLNCSLVEVCALT 

Ctggtgaattttgtgttattttcagtgtcaacataa SEQ ID NO: 10080 
LVNFVLFSVST-SEQIDNO: 10027 



FIGURE 120 

FIGURE 120A 

PRHTQRT-PTVDSSL-WSK-ITKSMVTLICLSPAKKLFWFWGLALM-RAVMQ 

LEMLWVLTYLSS-DFLQVLT — LYRLVMLTLKITQNSPELMQMLHQVTSLNILYH 

SCIKACPGM-CYLR-YKCSVIH-KDCQTESCSSFGRMALSLHQ-STLSRLDIjKER 

WCVTWQLAFLmQILMPAGIILWVLTMSITHL-LMFSSGALRW 

RYMEMHMV^VVMLS-LDV-QSMSALLSALIGLIOTLL-EMN-GLILLAEKYMW. 

-SLHCLLISFQFFMT-EIQRLSSVCLRLK-NGSSTMLSHVWKLTK-RNSSIim 

YITINSLMVWCFGIWLIWQPMQLCVGLTQESCQT-TYQAVMVWOl-ISMHS 

TLQLSIKVHLLI-SNCLSFTILIVLVSLMANK-CRILIMFHSNLLRVLHDAI-W 

LFADTMQMSTDSTWMHII — FLLDLAYGFTNNLILITCGIHLPGYRV 

FIGURE 120B 

LGIPKGHDLP-THLYDGFQ]^LPSQWLP-YVYHPRRSYSSRSC!VDWL-CRGLSCN 
-RCCGY-PTSPARIFYRC-LSSCTDWLC-H-K-HRIHQS-CKTSTR-PV-TSYTT 
HV-RLALECSAY-DSTNAQ-YTERIWQSRWPLGAWL-AYINEVLCQDWT-KW 
LSV-QTCNLLFYFIRYLCLLESFCGF-LCL-PIYD-CSAVGLYG-PSE-P-PTLP 
GTWKCTCG-L-CTHD-MFSSP-VLC-AR-LVC-IPYYRR-TEG-FCLQKSTTHGC 
EVCIAC — VSSSS-HRKSKGYQVCASG-SRMEVLRCSAM — QSLQNRGTLLFLCY 
TSR-IH-WCLFVLEL-R-SLPSQCNCV-V-HKSLVKLELTRL-WW-FVCE-ACIP 
HSSFR-KCIY-FKAIAFLLLF — SL-VSWQTSSVGY-LCSTQICYVYYTMQFRWC 
CLQTPCK-VPTVLGCI -YDDFCWI -PMDLQTI-YL- PVEYIYQVTEF 
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FIGURE 120C 

-AYPKDMTYRRLI SMGFKMl^QWGYPNMFITREEMRHVRAWIGFDVEGCHATRDAVG 
TNLPLQLGFSTGVNLVAVPTGYVDTEl^ 

RIKIVQMLSDTLKGLSDRVVFVLWMGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSD 
TYACWNHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVH 

HECFVKRVDWSVEYPI IGDELRWSACRKVQHMVVKSALLADKFPVLHDIGNPKMKCVP 
QAEVEWKFYDAQPCSDKAYKIEELFYSYAIHHDKF^ 
TRVLSNI^PGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPC 
IDYVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMI SAGFSLWIYKQFTDTYNLWNT 
FTRLQSL 

FIGURE 120D 

-tl-pgkcipqvisiklfvl^-mssrl^iiciqvlsvlicwsmstt-iascntrsr 

f™iimrhylfamrltrtirivkerqll-iskctfieswsvecmlihio^ittitaw-v 

qv-qds™pthncigwvtinvtipko™ , isefi^ 

lpfyfslrhtldslwi syvmknwkli skqcrlhnhvlyfsasrinpqfi s ynrvfnrpin 

alnkalmdc -tsshds ittshmc i smylamlvmvtij<vtrkapllninhkwidivkthr 

mi pagi si - - srkasctfwqttrsfrsnldkvlh-cklkamrpkdehdsv-qsfqc ite . 

hlyylnthyi pgqafihewykmfkl^ -vntcrks 

-LER-VSTHSISSCMTALYIKANPRTNVTNSFFAGDKHIRVTIDLVIHFETHHRDESTVG 
HVLWVCL 

FIGURE 120E 

KLCNLVNVFHRL-VSNCL- IHRLNPAEI I ILYASKYCRYSFAWCLQTAPPKLHRVIHVAD 
LSGT-SISDTTCLP-DSQGLSE- -KKGNCFKLVNALLSKAGVWNACLFTYKLPPSQPGKF 
KFDKTLVSNLHTIAIAG-RSTLQFQN^^ 

FHSTSA-GTHLIAFGFPMS-RTGNLSASNADFTTMCCTFLQAELTLSSSPIIGYSTDQST 
RLTKHSWAKHLVMIASQLATCAFPCTWQCWSWLL-RLPVKPHC-TSIINGL-T-SKPTE 
-FQQA-VSDEVEKQVMUjSHRQHVLSGPILTKYFIDVSSKPCAQRTNTTLSDNPFSVSLS 
ICTILIRTTFQGKPLYMSGIRCLMSPGGGFALTLVNSVLFSVST-PVGTATKLTPVENP 
SWRGRLVPTASLVA-QPSTSKPIHART-RIASSRVINILG-PLTW-FILKPIIEMSLR-V 
MSFGYA- 

FIGURE 120F 

NSVTW-MYSTGYKYQrVCKSIG-IQQKSSYYMHPSWGTHLHGVCKQHHLNCIV-YT-QI 
-VEHNQYPTLLVCHETHKDYQNSKRKAIALN — MHFYRKLECGMHAYSHTNYHHHSLVSS 
SLTRLLCQTYTQLHWLGNDQRYNSKTNKHHQ-IYRDV-HKM^ 

SILLQPEAHT — PLDFLCHEELETYQQAMQTSQPCWLFCKQN-PSVHLL — GIQQTNQR 
A-QSTHGLLNI -S — HHN- PHVHFHVPGNVGHGYSEGYP- SPTAEHQS-MGYRHSQNPQN 
DSSRHKYLMK-KSKLHVCHTDNTFFQVQS-QSTSLM-AQSHAPKGRTRLCLTILSVYH-A 
FVLS-YALHSRASLYT-W-DV-TGHLVEVLH-LW-ILCYFQCQHNQSVQLLS-HL-KIL 
AGEVG-YPQHL-LHDSPLHQSQSTHERDE-LLRG— TY-GNH-LGNSF-NPS-R-VYGRS 
CPLGMPR 
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FIGURE 121 
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I I I I I I 

SEQ ID NO: 10033 TACCGTAGACTCATCTCTATGATGGGTTTCAAAA 

SEQ ID NO: 10084 CCTAGGCATA(ICCAMGGACATGACCTA 

Consensus TACCCTAGACTCATCTCTATGATGGCTTTCAAAA 
Prim. cons. (X7TAGGCATA(X:CAAAGGACATGACmCCGTAGACTCATC^^ 
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I I I I I I 

SEQ ID NO: 10033 TGAAOTACCAAGTCAATGGITACCCT 

SEQ ID NO: 10084 TGAATTACCAAGTCAATSTTTACCCTA^ 

Consensus TGAATTACCAAGTC^TGGTTACCCTAATATGTTTATCACCCGCGAAGAAGCTA 
Prim. cons. TGAATTAC(2AAGTCAATGGTTA(^ 

130 140 150 160 170 180 

I I I I I I 

SEQ ID NO: 10033 ACGTTCGTGCGTGGATTGGCTITGATGTAGAGGGCnxn^ 

SEQ ID NO: 10084 ACOTTa3TGCGTGGATTGGaTIX3ATGTAG^ 

Consensus ACGTTCGTGCGTGGATTGGCTTTGATOTAGAGGGCTGTC^^ 

Prim. cons. . ACGTTCGTGCGTCGATTGGCTT^ 



190 200 210 220 230 240 

I III I I 

SEQ ID NO: 10033 GTACTAACCTACCTCITCCAGCTAGGATTTTCTACAGGTGTTAAOT^ 

SEQ ID NO: 10084 GTACTAACCTACCTCTCCAGCTAGGATTTTCTA^ 

Consensus GTACTAACCTACCTCTCCAGCTAGGATTTTCTAC^GGTGTTAACTTAGT 

Prim. cons. GTACTAACCTACCTC^C^GCTAGGATTTTCTACAGGTGTTAACTO 



250 260 270 280 290 300 

I I I I I I 

SEQ ID NO: 10033 CTXSGTTATGTTGACACTGAA^ 

SEQ ID NO: 10084 CTGGOTATGTTGACACTGAA 
Consensus CTCXSTTATGTTGACACTGAAAA^ 

Prim. cons. CTGGvPTATGTTGA(^CTGAAAATAACAC!AGAATTCACCA^ 



310 320 330 340 350 360 

I I I I I I 

SEQ ID NO: 10033 CAGGTGACCAGTrTAAACATCTTATACC 

SEQ ID NO: 10084 CAGGTGACCAGITTAAACATCOT 

Consensus CAGGTGACCAGTTTAAACATCTTATACC 

Prim. cons. (TAGGTGAC(]AGTTTAAA(ZATCTTATAC(^CTCATGTATAMG 



etc. 



WO 2004/092360 



147/193 



PCT/US2004/011710 



FIGURE 122 

5*3' Frame 1 



cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaa 

PRHTQRT-PTVDSSL-WVSK 
tgaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtc 

-ITKSMVTLICLSPAKKLFV 
acgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgg 

TFVRGLALM-RAVMQLEMLW 
gtactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccga 

VLTYLSS-DFLQVLT--LYR 
ctggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccac 

LVMLTLKITQNSPELMQNLH 
caggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtag 

QVTSLNILYHSCIKACPGM- 
tgcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcg 

CVLR-YKCSVIH-KDCQTES 
tgttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattg 

C S S F GRMAL S LHQ - S TL S RL 
gacctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcag 

DLK ERVVCVTNVQLAFLLHQ 
atacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatga 

ILMPAGI ILWVLTMS ITHL-- 
ttgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcc 

LMF S SGALRVTFRVTMTNIA 
aggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcag 

R Y MEMHMWLVVMLS-LDV-Q 
tccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatg 

SMSALLSALIGL LNTLL-EM 
aactgagggttaattctgcttgcagaaaagtacaacacatggt£gtgaagtctgcattgc 

N-GL I LLAEKYN TWL - SLHC 
ttgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgc 

LLI SFQFFMT-EIQRLSSVC 
ctcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttaca 

LRLK-NGSSTMLSHVVTKLT 
aaatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgttt 

K-RNSSILMLYITINSLMVF 
gtttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttg 

VCFGIVTLIVTQPMQLCVGL 
acacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtga 

TQE SCQT-TYQAVMVVVCM- 
ataagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaat 

ISMHSTLQLSIKVHLLI-SN 
tgcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgg 

CLSFTILIVLVSLMANK-CR 
atattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtg 

ILIMFHSNLLRVLHDAI - VV 
ctgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatga 

LFADTMQMSTDSTWMHII-- 
tttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaata 

FLLDLAYGFTNNLILITCGI 
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catttaccaggttacagagttta 
H L P G Y R V 



5' 3' Frame 2 

cctaggcatacccaaaggacatgacctaccgtagactcatctctatgatgggtttcaaaat 

LGIPKGHDLP-THLYDGFQN 
gaattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtca 

ELPSQWLP-YVYHPRRSYSS 
cgttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtggg 

RSCVDWL-CRGLS CN-RCCG 
tactaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgac 

Y-PTSPARIFYRC-LSSCTD 
tggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacc 

WLC-H-K-HRIHQS-CKTST 
aggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagt 

R-PV-TSYTTHV-RLALECS 
gcgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgt 

AY-DSTNAQ-YTERIVRQSR 
gttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgg 

VRP L GAWL - AYI NEVLCQDW 
acctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcaga 

T-KNVLSV-QTCNLLFYFIR 
tacttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgat 

YLCLLESFCGF-LCL-PIYD 
tgatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgcca 

-CS AVGLYG-PSE-P-PTLP 
ggtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagt 

GTW.KC T C G - L - C YH D - M F S S 
ccatgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatga 

P-VLC-AR-LVC-IPYYRR- 
actgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgct 

TEG-FCLQKSTTHGCEVCIA 
tgctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcc 

C - - VS S S S - HRKS KGYQVCA 
tcaggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaa 

SG-SRMEVLRCSAM- - QSLQ 
aatagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttg 

NRGTLLFLCYTSR - IH-WCL 
tttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttga 

F'VLEL-R-SL-PSQCNCV-V- 
cacaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaa 

HKSLVKLELTRL-WW-FVCE 
taagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaatt 

-ACIPHSSFR-KCIY-FKAI 
gcctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcgga 

AFLLLF - -SL-VSWQTSSVG 
tattgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgc 

Y-LCSTQICYVYYTMQFRWC 
tgtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgat 
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CLQTPCK-VPTVLGCI-YDD 
ttctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaatac 

FCWI-PMDLQTI-YL-PVEY 
atttaccaggttacagagttta 

I Y Q V T E F 



5*3* Frame 3 

cctaggcatacccaaaggacATGacctaccgtagactcatctctatgatgggtttcaaaatg 

-AYPKDMTYRRLI SMMGFKM 
aattaccaagtcaatggttaccctaatatgtttatcacccgcgaagaagctattcgtcac 

NYQVNGYPNMFITREEAIRH 
gttcgtgcgtggattggctttgatgtagagggctgtcatgcaactagagatgctgtgggt 

VRAWIGFDVEGCHATRDAVG 
actaacctacctctccagctaggattttctacaggtgttaacttagtagctgtaccgact 

T NLPLQLGFSTGVNLVAVPT 
ggttatgttgacactgaaaataacacagaattcaccagagttaatgcaaaacctccacca 

GYVD TENNTEFTRVNAKPPP 
ggtgaccagtttaaacatcttataccactcatgtataaaggcttgccctggaatgtagtg 

GDQFKHL I PLMYKGLPWNVV 
cgtattaagatagtacaaatgctcagtgatacactgaaaggattgtcagacagagtcgtg 

RIKIVQML.S'DTLKGLSDRVV 
ttcgtcctttgggcgcatggctttgagcttacatcaatgaagtactttgtcaagattgga 

FVLWAHG FELTSMKYFVKIG 
cctgaaagaacgtgttgtctgtgtgacaaacgtgcaacttgcttttctacttcatcagat 

PERTCC LCDK RA TCFSTSSD 
acttatgcctgctggaatcattctgtgggttttgactatgtctataacccatttatgatt 

TYACWNH SVG- FDYVYNPFMI 
gatgttcagcagtggggctttacgggtaaccttcagagtaaccatgaccaacattgccag 

DVQQWGFTGN LQSNHDQHCQ 
gtacatggaaatgcacatgtggctagttgtgatgctatcatgactagatgtttagcagtc 

VHGNAHVASCDAIMTRCLAV 
catgagtgctttgttaagcgcgttgattggtctgttgaataccctattataggagatgaa 

HECFVKRVDWSV EYPI IGDE 
ctgagggttaattctgcttgcagaaaagtacaacacatggttgtgaagtctgcattgctt 

LRVNSAC RKVQHMVVK SAL L 
gctgataagtttccagttcttcatgacataggaaatccaaaggctatcaagtgtgtgcct 

ADKFPVLHDIGNPKAIKCVP 
caggctgaagtagaatggaagttctacgatgctcagccatgtagtgacaaagcttacaaa 

QAEVEWKFYDAQPC SDKAYK 
atagaggaactcttctattcttatgctatacatcacgataaattcactgatggtgtttgt 

IEELFYSYAIHHDKFTDGVC 
ttgttttggaattgtaacgttgatcgttacccagccaatgcaattgtgtgtaggtttgac 

LFWNCNVDRYPANAI VCRFD 
acaagagtcttgtcaaacttgaacttaccaggctgtgatggtggtagtttgtatgtgaat 

TRVLSNLNLPGCDGGSLYVN 
aagcatgcattccacactccagctttcgataaaagtgcatttactaatttaaagcaattg 

KHAFHTPAFDKSAFTNLKQL 
cctttcttttactattctgatagtccttgtgagtctcatggcaaacaagtagtgtcggat 

PFFYYSDSPCESHGKQVVSD 
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attgattatgttccactcaaatctgctacgtgtattacacgatgcaatttaggtggtgct 

IDYVPLKSATCITRCNLGGA 
gtttgcagacaccatgcaaatgagtaccgacagtacttggatgcatataatatgatgatt 

VCRHHANEYRQYLDAYNMMI 
tctgctggatttagcctatggatttacaaacaatttgatacttataacctgtggaataca 

SAGFSLWIYKQFDTYNLWNT 
tttaccaggttacagagttta 

F T R L Q S L 



3»5" Frame 1 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaat 

-TL-PGKCIPQVISIKLFVN 
ccataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactc 

P-AKSSRNHHIICIQVLSVL 
atttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcaga 

ICM VSANSTT-IASCNTRSR 
tttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggact 

FEWNI INIRHYLFAMRLTRT 
atcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagc 

IRIVKERQLL-ISKCTFIES 
tggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagtt 

WSVECMLIHIQTTTITAW-V 
caagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatc 

QV-.QDSCVKPTHNCIGWVT I 
aacgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcata 

NVT I P K Q TNT I S E F IVMY S I 
agaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaa 

RIEEFLYFVSFVTTWLSIVE 
cttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatg 

LPFYFSLRHTLDSLWISYVM 
aagaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttct 

KNWKLI SKQCRLHNHVLYFS 
gcaagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaac 

AS R.INPQ F I S YNRVFNRP IN 
gcgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagc 

ALNKAL. MDC-TSSHDSITTS 
cacatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgt 

HMC I SMYLAMLVMVTLKVTR 
aaagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacaga 

KAPLLNINHKWVIDIVKTHR 
atgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcaca 

MIPAGISI--SRKASCTFVT 
cagacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaa 

QTTRSFRSNLDKVLH-CKLK 
gccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgag 

AMRPKDEHDSV-QSFQCITE 
catttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtat 

HLYYLNTH YIPGQAFIHEWY 
aagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtt 
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KMFKLVTWWRFCINSGEFCV 
attttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcc 

IFSVNITSRYSY-VNTCRKS 
tagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatc 

-LER-VSTHSISSCMTALYI 
aaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagg 

KANPRTNVTNSFFAGDKHIR 
gtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggt 

VTIDLVIHFETHHRDESTVG 
catgtcctttgggtatgcctagg 

H V L W V C L 



3*5' Frame 2 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatc 

KLCNLVNVFHRL - VSNCL - I 
cataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactca 

HRLNPAEI IILYASKYCRYS 
tttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagat 

FAWCLQTAPPKLHRVIHVAD 
ttgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggacta 

L S G T - S I S D T T C L P - D S Q G L 
tcagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagct 

SE - - KKGNCFKLVNALLSKA 
ggagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttc 

GVWNACLFTYKLPPSQPGKF 
aagtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatca 

KFDKTLVSNLHTIALAG-RS 
acgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataa 

TLQFQNKQTPSVNLS-CIA- 
gaatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaac 

E-KSSSIL-ALSLHG-AS-N 
ttccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatga 

FHSTSA-GTHLIAFGFPMS- 
agaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctg 

RTGNLSASNADFTTMCCTFL 
caagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacg < 

QAELTLSSSPI IGYST DQST 
cgcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcc 

RLTKHSWTAKHLVMIASQLA 
acatgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgta 

TCAFPCTWQCWSWLL-RLPV 
aagccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaa 

KPHC-TSIINGL-T-SKPTE 
tgattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacac 

-FQQA-VSDEVEKQVARLSH 
agacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaag 

RQHVLSGPILTKYFIDVSSK 
ccatgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagc 

PCAQRTNTTLSDNPFSVSLS 
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atttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtata 

ICTILIRTTFQGKPLYMSGI 
agatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgtta 

RCLNWS PGGGFALTLVNSVL 
ttttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatcct 

FSVST - PVGTATKLTPVENP 
agctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatca 

SWRGRLVPTASLVA-QPSTS 
aagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattaggg 

KPIHART -RIASSRVINILG 
taaccattgacttgcrtaattcattttgaaacccatcatagagatgagtctacggtaggtc 

-PLTW-FILKPIIEMSLR-V 
atgtcctttgggtatgcctagg 

M S F G Y A - 



3»5" Frame 3 

taaactctgtaacctggtaaatgtattccacaggttataagtatcaaattgtttgtaaatcc 

NSV TW-MYSTGYKYQIVCKS 
ataggctaaatccagcagaaatcatcatattatatgcatccaagtactgtcggtactcat 

IG -I QQKSSYYM HPSTVGTH 
ttgcatggtgtctgcaaacagcaccacctaaattgcatcgtgtaatacacgtagcagatt 

LHGV CKQHHLNC I V - Y T - Q I 
tgagtggaacataatcaatatccgacactacttgtttgccatgagactcacaaggactat 

-VEHNQYPTLLVCHETHKDY 
cagaatagtaaaagaaaggcaattgctttaaattagtaaatgcacttttatcgaaagctg 1 

Q NS KRKA I ALN - - MHFYRKL 
gagtgtggaatgcatgcttattcacatacaaactaccaccatcacagcctggtaagttca 

ECGMHAY SH TNYHHHSLVS S 
agtttgacaagactcttgtgtcaaacctacacacaattgcattggctgggtaacgatcaa 

SLTRLLCQTYTQLHWLGNDQ 
cgttacaattccaaaacaaacaaacaccatcagtgaatttatcgtgatgtatagcataag 

RYNSKTNKHHQ - IYRDV - HK 
aatagaagagttcctctattttgtaagctttgtcactacatggctgagcatcgtagaact 

NRRVPLFCKLCHYMAEHRRT 
tccattctacttcagcctgaggcacacacttgatagcctttggatttcctatgtcatgaa 

SILLQPEAHT--PLDFLCHE 
gaactggaaacttatcagcaagcaatgcagacttcacaaccatgtgttgtacttttctgc 

ELETYQQAMQTSQPCVVLFC 
aagcagaattaaccctcagttcatctcctataatagggtattcaacagaccaatcaacgc 

KQN-PS VHLL - - GIQQTNQR 
gcttaacaaagcactcatggactgctaaacatctagtcatgatagcatcacaactagcca 

A-QSTHGLLNI-S--HHN-P 
catgtgcatttccatgtacctggcaatgttggtcatggttactctgaaggttacccgtaa 

HVHFHVPGNVGHGYSEGYP 
agccccactgctgaacatcaatcataaatgggttatagacatagtcaaaacccacagaat 

SPTAEHQS-MGYRHSQNPQN 
gattccagcaggcataagtatctgatgaagtagaaaagcaagttgcacgtttgtcacaca 

DSSRHKYLMK-KSKLHVCHT 
gacaacacgttctttcaggtccaatcttgacaaagtacttcattgatgtaagctcaaagc 
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DNTFFQVQS - QSTSLM-AQS 
catgcgcccaaaggacgaacacgactctgtctgacaatcctttcagtgtatcactgagca 

HAPKGRTRLCLTILSVYH-A 
tttgtactatcttaatacgcactacattccagggcaagcctttatacatgagtggtataa 

FVLS-YALHSRASLYT-VV- 
gatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgtgttat 

DV-TGHLVEVLH-LW - ILCY 
tttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaatccta 

FQCQHNQSVQLLS-HL-KIL 
gctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctacatcaa 

AGEVG-YPQHL - LHDSPLHQ 
agccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatattagggt 

SQSTHERDE-LLRG--TY-G 
aaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggtaggtca 

NH-LGNSF-NPS-R-VYGRS 
tgtcctttgggtatgcctagg 

C P L G M P R 
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FIGURE 123 



(X)TAGGCATA(X£AAAGGACATGACCT^^ 

i N..N i M.... 

TACXX^MTATGTTTATCACCCGCGMGAAGC^ 

i N N 

TGCMCTAGAGATGCTCTGGGTACT^ 

N 

CTGGTTATGTTGACACTGAAAATMCACA^ 

N N 

CTTATACCACTCAT6TATAAAGGCTTGCX!CTGGAATCTAff 

N N N 

AGGATTGTCAGACAGAGTCGTGTTC^^ 

N N 

GACCTGAAAGMCGTGTTCTCTGTGTGA(^^ 

N 

CATTCTGTGGGTTTTGACTATGOT 

N N N 

TMCCATGAO^CATTGCCAGGTAC^^ 

N N N N N i N 

TCCATGAGTGCTTTGTTMGCG(X3T^ 

...N N 

TGCAGAAAACTA(^CACATGG^^ 

i N. 

AAAGGCTATCAAGTGTGTGCCT(^GGCTGAAGTA 

N N N 

AAATAGAGGAACTCTTCTATTCTTATGOT 

N N 

GTTGATCGTTACCCAGCCMTGCMTTGTGTGTAGGTT^ 

N ....N 

TGGTGGTAGTTTGTATGTCAATMGCATGCATTCCAC^ 

N N 

TGCCTTTCTTTTACTATTCTC^ 

i N 

AMTCIGCTACGTGTATTACACGATG^ 

N N N 

GGATGCATATMTATGATGATTOTGCTGGA 

..N N..N N 

CATTTACCAGGTTACAGAGTTTA SEQ ID NO: 10084 
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FIGURE 123 (contd.) 
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FIGURE 124 



Sequences : 
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gi 
gi 
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gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 



74827 |pir | |VFIHJH genome polyprotein lb - murine hepatit. 
14917044 sp|P29982|RRPB_CVttJH RNA-directed RNA polymeras . 
26007546 ref |NP_068668.2| ORFlab polyprotein [Murine hep. 
7769342 gb AAF69332.1 AF208066_2 RNA-directed RNA polyme . 
6625761 gb AAF19384.1 AF201929_2 RNA-directed RNA polyme. 
2641128 gb AAB86818.1 RNA-directed RNA polymerase [muri. 
4377413 emb|CAA36202.l| open reading frame lb (AA 1-2733. 
133592 | sp | P16342 | RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE 



26008080 
15077820 
18033972 



ref |NP_150073.2 
gb|AAK83365.1 
gb|AAL57305.1 



or flab polyprotein [Bovine cor. 
replicase [bovine coronavirus] 
replicase [bovine coronavirus] 
7769353|gb|AAF69342.l|AP208067_2 RNA-directed RNA polyme. 
17529672' 
25121571 
26008092 
10242469 
14149033 
458735 
133594 
29293454 
25121555 



gb|AAL40397.1|AP220295_2 RNA polymerase lb [bov. 
ref NP_740618.1 coronavirus nspll [Murine hepa. 
ref NP_742140.1 coronavirus nspll [Bovine coro. 
ref NP_066134.1 ORPlab polyprotein; frameshift. 
emb CAC39112.l| replicase polyprotein lab [Avia. 
emb|CAA83018.l| potential chimeric protein [Avian, 
sp | P2 6314 | RRPB_IBVB RNA-DIRECTED RNA POLYMERASE ( . 



gb|AAO67706.1| 
ref |NP_740631.1| 
9635157 | ref |NP_058422 . 1 | 



ORFlb polyprotein [Avian infecti. 
coronavirus nspll [Avian infec. 
replicase [Transmissible gastro. 
19387582 ref NP_598309.1 Poll [porcine epidemic diarrhe. 
12175747 ref NP_073549.1 replicase polyprotein lab [Hum. 
133591 1 sp | P18458 1 RRPB_BEV RNA-directed RNA polymerase (0 . 
1513061 | dbj | BAA13323 . 1 | cyanoprotein alpha subunit precu . 
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Alignments 



>gi|74827|pir| |VFIHJH 
(strain JHM) 

Length = 2731 



genome polyprotein lb - murine hepatitis virus 



Score = 638 bits (1645), Expect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISmGFKMNYQWG 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1585 VTYSRLI SLMGFKLDLTLIX^ IGTNFPLQ 1644 

Query: 66 LGFSTGVNLVAVPTGYVDTE^ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
Sbjct: 1645 I/3FSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKH^ 1704 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELT 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1705 MLSDHLVDLADSVVLVTWAASFELTCLRYFAKVGKEWCSV 1764 



Query: 186 HSVGFDYVYNPFMIDVQQWGFTGN^ 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1765 HSYSODYLYNPLIVDIQQWGOTGSLTSNHDPICSVHKGAHVAS^ 1824 
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Query: 246 RVDWSVEYPIIGDEIJIWSA 305 

V+W++BYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1825 SVNWNLEYPI I SNEVSVNTSCRLLQRVMFRAAMLCNRYDV -YDF 1882 

Query: 306 KFYBAQPCSDKAYKIEELFYSYAIHHD^^ 365 

KFYDA P +++ Y Y H D+P IXHC+FWNCNVIHYPANA+VOTFiyrRVLS 

Sbjct: 1883 KFYDASPV---VKSVKQFVYKYEM1KDQFL^ 1939 

Query: 366 LNLPGCDGK3SLYVNIQIAFHTPAFDKSAFTNLK 425 

LNLPGC+GGSLYVNKHAFHT P ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1940 LNLPGCNGGSLYVNKHAFHTNPFTRAAFENLK^ 1999 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQY 485 

+SATC ITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLYJNTFTRLQS 
Sbjct: 2000 RSATCITRCNLGGAVCLIQIAEEYREYLE 2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi 
gi 



14917044 1 sp|P29982 | RRPB_CVMJH RNA-directed RNA polymerase (ORFlB) 
758332l|gb|AAA46458.2| open reading frame lb [murine hepatitis virus] 
Length = 2731 



Score = 637 bits (1644), Expect = 0.0 

Identities = 287/481 (59%), Positives =366/481 (76%), Gaps = 5/481(1%) 

MTYRRLISMMGFKMNYQ 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HATRD++GTN PLQ 



LGFSTGVNLVAVPTGWVDTENNTEFTR^ 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 



MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCL^ 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C+KRATCF++ + Y CW 



HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAH^ 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



RVDWSVEYPIIGDELRWSACRKVQHMWKSAL 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 



KFYDAQPCSDKAYKIEELF^SYMHHDKFTIXSVCLFWNC^ 365 
KFYDA P +++ Y Y H D+F IX3+C+FWNCNVD+YPANA+VCRFDTRVLS 



LNLPGCIX5GSLYVNKHAFHTPAFDKSAFTNLKQLPF 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 



Query: 


6 


Sbjct: 


1585 


Query: 


66 


Sbjct: 


1645 


Query: 


126 


Sbjct: 


1705 


Query: 


186 


Sbjct: 


1765 


Query: 


246 


Sbjct: 


1825 


Query: 


306 


Sbjct: 


1883 


Query: 


366 


Sbjct: 


1940 



WO 2004/092360 



158/193 



PCT/US2004/011710 



Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLnA 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YKFD YNLWNTFTRLQS 
Sbjct: 2000 RSATCITRCNLGGAVCLKHAEEYREYLESYOT 2059 

Query: 486 L 486 
L 

Sbjct: 2060 L 2060 



>gi 1 26007546 1 ref |NP_068668.2| ORPlab polyprotein [Murine hepatitis virus] 
Length = 7178 

Score = 637 bits (1644), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQWGYPNMFI 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 6032 VTYSRLISLMGFKLDLTL^^ 6091 

Query: 66 LGFSTGVNLVAVPTGT^VDTENOT 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 6092 LGFSTGIDFVVEATGMFAERIX5YVFKKAAARAPPGEQFKHLI PLMSRGQKWDWRIRIVQ 6151 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCC 185 

MLSD L L+D WVWA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 6152 MLSDHLADLADSWLVTWAASFELTC 6211 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVA 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 6212 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPIC 6271 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQro 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 6272 SVNWNLEYPIISNEVSVNTSCRL^ 6329 

Query: 306 KFYDAQPCSDKAYKIEELFYSYMHHDKFTDG^ 365 

KFYDA P +++ Y Y H D+F IX3+C+FVmCNVD+YPANAWCRJ^^ 

Sbjct: 6330 KFYDASPV VKSVKQFVYKYEAHKDQFLIX5LCMFWNC^ 6386 

Query: 366 LNLPGCDGGSLYVNIOIAFHTPAFD^ 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 6387 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENI^^ 6446 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMMI SAGFSLWIYKQFDTYNLWOTTTRLQS 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 6447 RSATCITRCNLGGAVCLKHAEEYREYLESYOT 6506 

Query: 486 L 486 
L 

Sbjct: 6507 L 6507 

>gi|7769342|gb|AAF69332.1|AF208066_2 RNA-directed RNA polymerase [murine 
hepatitis virus] 

Length = 2732 
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Score = 637 bits (1644), Expect = 0.0 

Identities = 287/481 (59%), Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYIUILISMMGFKMNYQWGYF 65 

+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRDf+GTN PLQ 
Sbjct: 1586 VTYSRLISLMGFKLDLTL^ 1645 

Query: 66 LGFSTGVl^VAVPTGYVDTE 125 

LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
Sbjct: 1646 LGFSTGIDFVVEATGMFAERIXSYVFKKAVARAPrc 1705 

Query: 126 MLSDTLKGLSDRVVFVLWAH^ 185 

MLSD L L+D W VM FELT ++YF K+G E C +C+KRATCF++ + Y CW 
Sbjct: 1706 MLSDHLVDLADSVVLVTWAAS 1765 

Query: 186 HSVGFDYVYNPFMIWQQWGFTGNLQSNHDQHCQV^ 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCTiAVH+CF K 
Sbjct: 1766 HSYSCTYLYNPLIVDIQQWG^TCSLTSNHDLICSVHKGAH^ 1825 

Query: 246. RVDWSVEYPIIGDKLRVNSACRKVQHMVVKSALI^ '305 

V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1826 SVNWSLEYPIISNEVSVNTSCR^RVMFR^ 1883 

Query: 306 KFYDAQPCSDKAYIQEELFYSYMHHDKF^ 365 . 

KFYDA P +++ Y Y H D+F IX5+C+FWNCNVD+YPANA+VCRFDTRVL+ 

Sbjct: 1884 KFYDASPV VKSVKQFVYKYEAHKDQFLIX5LCMFWNCNVDKYPA 1940 

Query: 366 LNLPGOX^SLYVNKHAFOT^ 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC • +DYVPL 

Sbjct: 1941 LNLPGCNGGSLYVNKHAFOTSPFTRAAFENLKPMPFFY^ 2000 

Query: 426 KSATCITOQJLGGAVOUfflANEYRQYLDAY 485 

+SATCITRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2001 RSATCITRCNLGGAVGjKHAEDY^ 2060 

Query: 486 L 486 
L 

Sbjct: 2061 L 2061 . 



>gi| 6625761 jgb|AAF19384.l|AF201929_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain 2] 

gi|7739595|gb|AAF68920.l|AF207902_2 RNA-directed RNA polymerase [murine 
hepatitis virus strain ML-11] 
Length = 2733 

Score = 637 bits (1643), Expect = 0.0 

Identities = 287/481 (59%) , Positives = 366/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQWGYFNMFITREEAIRHVRAWI 65 

+TY RLIS+MGFK++ ++GY +FITR+EAIR VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLIX5YCKLFITRDEAIR^ 1646 

Query: 66 LGFSTGPVNLVAVPTGYVDTENNTE 125 
LGFSTG++ V TG + F + A+ PPG+QFKHL+PLM +G W+WRI+IVQ 
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Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGYVFKKAVARAP 1706 

MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPER 185 
MLSD L L+D WVWA FELT ++YF K+G E C +C+KRATCF++ + Y CW 



HSVGFDYVYNPFMIDVQQWGFTGNLQSNHIXJHCQVHGNAHVASCT 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



RVDWSVEYPIIGDELRWSACnOCVQHMVVKSAI^^ 305 
V+WS+EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 



KFYDAQPCSDKAYKIEELFYSYAIHHDKFT1X3VQ 365 
KFYDA P +++ Y Y H D+F DG+C+FWNCNVIH-YPANA+VCRFDTRVL+ 



LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCE 425 
LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSOT 2001 

KSATCITRCNLGGAVCRHHANEYRQYi^ 485 
+SATCITRCNLGGAVC HA +YR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 





1647 




126 


D u J tt. 


1707 


yuery . 




CTK~i /-if* • 


1 7fi7 


Query: 




oDJCC . 


1 Q97 


Query: 




PVi-I /if- . 

OUJ Cu . 


looD 


yuery . 


jOD 




1 QA9 


Query: 


42.6 


Sbjct: 


2002 


Query: 


486 


Sbjct: 


2062 



>gi|2641128|gb|AAB86818.l| RNA-directed RNA polymerase [murine hepatitis 
virus] 

Length = 2733 
Score = 635 bits (1637), Expect = 0.0 

Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

MTYRRLISMMGFKMNYQWGYPNMFITREEMRHVRA 65 
+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 



LGFSTGVNLVAVPTGYVDTENNTEFTO 125 
LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 



MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIG 185 
MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ + Y CW 



HSVGFDYVYNPFMIDVQQWGFTGNLQ^ 245 
HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 



Query: 


6 


Sbjct: 


1587 


Query: 


66 


Sbjct: 


1647 


Query: 


126 


Sbjct: 


1707 


Query: 


186 


Sbjct: 


1767 



Query: 246 RVDWSVEYTIIGDELRWSACRKVQHMVVKSALI^ 305 
V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
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Sbjct: 1827 SVNWNLEYPIISNEVSVOTS 1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTDG^^ 365 

KFYDA P +++ YY HDfF IXHC+FWNOWD+YPA^ 

Sbjct: 1885 KFYDASFV VKSVKQFVYKYEAHKDQFLIXSL^^ 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFT^ 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGOTCGSLYVNKHAFHTSPFTR^^ 2001 

Query: 426 KSATCITRCNLGC^VCRHHANEYRQYLDAYNMMISAGFSLW 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLGGAVCLiKHAEEYREYLESYOT 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



>gi 1 4377413 |emb|CAA3 6202.1) open reading frame lb (AA 1-2733) [Murine 
hepatitis virus] 

Length = 2733 

Score = 634 bits (163.6), Expect = 0.0 
-Identities = 286/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRELISMMGFKMNYQVNGYTNMFITRE^ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VTYSRLISLMGFKLDLTLDGYOO^ITRD 1646 

Query: 66 LGFSTGVl^VAVPTGYVDTENNTEFTO 125 

. LGFSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 1647 LGFSTGIDFVVEATGMFAERDGYVFKKAAARAPPGEQFKHL 1706 

Query: 126 MLSOTLKGLSDRVVFVLWAHGFELTS 185 

MLSD L L+D W V WA FELT ++YF K+G E C +C KRATCF++ f Y CW 
Sbjct: 1707 MLSDHLVDLADSVVLVTWAASFE 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQ^^ 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCTYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSD^ 1826 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQH 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVOTSC^ 1884 

Query: 306 KFYDAQPCSDKAYKIEBLFYSYMHHDKFTDGV^^ 365 

KFYDA P +++ Y Y H D+F DG+C+FWNOWIHYPANA+Va*^^ 

Sbjct: 1885 KFYDASFV-— VKSVKQFVYKYEAHKDQF^^ 1941 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNM 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTR 2001 

Query: 426 KSATCITRCOTiGGAVCEHHANEYRQYL^^ 485 
+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
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Sbjct: 2002 RSATCITRCNLGGAVCLKHAKBYRBYLESYNTATO 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 



gi 

A59 



133592 | sp 
93916 |pir 



P16342|RRPB_CVMA5 RNA-DIRECTED RNA POLYMERASE (ORFlB) 

| S15760 genome polyprotein - murine hepatitis virus (strain 



Length = 2733 



Score = 634 bits (1636) , Expect = 0.0 

Identities = 286/481 (59%) , Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 .MTY^ISmGFKMNYQWG^ 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA REH+GTN PLQ 
Sbjct: 1587 VTYSRLISIMGFKLDLTIJXjYCKLFITRDEAIK^ 1646 

Query: 66 LGFSTGVNLVAVPTGYVOTENNTEFTO 125 

LGPSTG++ V TG + F + A+ PPG+QFKHLIPLM +G W+WRI+IVQ 
Sbjct: 1647 LGFSTGIDFVVEATGMFAEMX5YVFKKAAARAPPGEQFKH^ 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELT 185 

MLSD L L+D WVWA FELT .++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MLSDHLVDLADSVVLVTWAASFELTCL^ 1766 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHC^ 245 

.HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSNHDPICSVHKGAHVASSDAIMra 1826 

Query: 246 RVDWSVEYPIIGDELRWSACR^ 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVNTSO*^ 1884 

Query: 306 KFYDAQPCSDKAYIQEELFYSYMHHDKFTIX^ 365 

KFYDA P +++ Y Y H D+F IX3+C+FWNCNVIHYPANA+VCRFDTRVL+ 

Sbjct: 1885 KFYBASPV VKSVKQFVYKYEAHKDQFLIX3LCM^ 1941 

Query: 366 LNLPGCIX^SLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDS 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENI^^ 2001 

Query: 426 KATCITRCNLGGAVCRHHANEYRQY 485 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYOT 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 

>gi | 26008080 |ref |NP_150073.2| orf lab polyprotein [Bovine coronavirus] 
Length = 7094 



Score = 633 bits (1633), Expect = e-180 
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Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYW^ISmGFKMN^ 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDVT^ 6007 

Query: 66 IX5FSTGVNLVAVPTGYVDTE^EFTO 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 6008 LGFSTGIDFVVEATGLFADRI^ 6067 

Query: 126 MLSDTLKGLSDRWFVLWMGFELTSMKYFVKIGPERTC 185 

M+DL LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDC^A/LVTWAANFELTCI^ 6127 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQH(^V^ 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSS 6187 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVVKSALLADK^ 305 

++W+VEYPII +EL +N++CR +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPIISNELSIOTSCRVLQRVMLKAA — KDFDF 6245 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFTOGVO^ 365 

KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct: 6246 KFYDAQPI---VKSVKTLLYSFEA 6302 

Query: 366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNIJCQL 425 

- LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 6303 LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHL 6362 

Query: 426 KSATCITRCNIX5GAVCRHHANEYRQYLDAYNMMI SAGFSLWI YKQFDTYNLWNTFTRLQS 485 

> KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCNLGGAVCIiKHAEEYREY 6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi|l5077820|gb|AAK83365.l| replicase [bovine coronavirus] 
Length = 7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 (1%) 

Query: 6 MTYRRLISMMGFKMNYQWGYPNMFITREEAIRHVRAWIGFD 65 

+TY RLIS+MGFK++ ++GY +FIT+BEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLDWLDGYCKI^^ 6007 

Query: 66 LGFSTGVNLVAVPTGf¥VDra 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 6008 LGFSTGIDFVVEATGLFADRIXJYSFKKAVAKAPPGEQFKHLIP 6067 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVIQGPERTCCLCDKRATC 185 

M+DL LSD W V WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSEK^VVLVTWAANFELTCLRYFA 6127 
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186 HSVGFDYVYNPFMIWQQWGFTGNI^SNHDQHCQVEG 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
6128 HSVTCDYLYNPLIVDIQQWGYIGSLS SNHDLYCSVHKGAHVASSDAIM11CLAVYDCFCN 6187 

246 RVDWSVEYPIIGDBLRWSACRKVQHMV^ 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
6188 NINVmVEYPIISNELSiOJTSCRV^ 6245 

306 KFYDAQPCSDKAYIQEELFYSYMHHDKFTDGV 365 

KFYDAQP ++ L YS+ H D F DG+C+FWNCNVEH-YP NA+VCRFDTRVL+N 

6246 KPYDAQPI VKSVKTLLYSFEAHKDSFKDGLCMFWNCN^ 6302 

366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFT^ 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
6303 LNLPGCNGGSLYVNKHAFHTKPFSRAAF^ 6362 

426 KSATCITRCNLGGAVCRHHANEYRQYLDA 485 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
6363 KSATCITRCNLGGAVCLKHAEBYRBYLESYNTATTAGF 6422 

486 L 486 
L 

6423 L 6423 



>gi 1 18033972 |gb|AAL57305.1 1 • replicase [bovine coronavirus] 
Length = 7094 

Score = 633 bits (1633), Expect = e-180 

Identities = 284/481 (59%), Positives = 367/481 (76%), Gaps = 5/481 Cl%) 

Query: 6 - MTYRRLI S1MGFKMNYQWGYPNMFITREEMRHVRAWIGFDVEGCHATO 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 5948 VTYSRLISLMGFKLEWLIX^ 6007 

Query: 66 LGFSTGVNLVAVPTGYVOTENNTEFTRVNAK^ 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 6008 LGFSTGIDFVVEATGLFADMX3YSFKKAVAKAPPGEQFKHLI 6067 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTC 185 

M+DL LSD WV WA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 6068 MFADHLIDLSDCVVLVTWAANFE 6127 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNH^ 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAB5TRCLAV++CF 
Sbjct: 6128 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVAS 6187 

Query: 246 RVDWSVEYPIIGDELRVNSACRKVQHMV^ 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 6188 NINWNVEYPIISNELSINTSCRVLQRVMLKAA 6245 

Query: 306 KFYDAQPCSDKAYKIEELFYSYMHHDKFTEK5VCLFWN 365 

KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct: 6246 KFYDAQPI VKSVKTLLYSFEAHKDSFKDGLCMEWCNTO 6302 



Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
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Query: 366 LNLPGOX^SLYVNKHAF^ 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 6303 LNLPGCNGGSLYVNKHAFHTK^ 6362 

Query: 426 KSATCITRCNLGGAVC^HHANEYRQYLDAY 485 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 6363 KSATCITRCNLGGAVC^ 6422 

Query: 486 L 486 
L 

Sbjct: 6423 L 6423 



>gi 1 7769353 |gb|AAF69342.l|AF208067_2 RNA-directed RNA polymerase [murine 
hepatitis virus] 

Length = 2733 

Score = 633 bits (1633) , Expect = e-180 

Identities = 285/481 (59%), Positives = 364/481 (75%), Gaps = 5/481 (1%) 

Query: 6 MTYRMiISMMGFKMNYQVNGYPNMFITREE^ 65 

++Y RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 1587 VSYSRLISLMGFKLDLTLIX^ 1646 

Query: 66 LGFSTGVNLVAVFTGYVDTENN^ 125 

LGFSTG++ V TG + F + A+ PPG+QFKHLI^LM +G W+WRI+IVQ 
Sbjct: 1647 IX3FSTGIDFVVEATGMFAERIXJ5fVFKKAAARAP 1706 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGP 185 

MLSD L L+D WVWA FELT ++YF K+G E C +C KRATCF++ + Y CW 
Sbjct: 1707 MiSDHLVDLADSVVLVTWAASFELTCLRYF 1766 

Query: 186 HSVGFDYVYNPFMIWQQWGFTGNLQSNHDQHCQVHGNAH^ 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
Sbjct: 1767 HSYSCDYLYNPLIVDIQQWGYTGSLTSiraDPICSVHKGAHVAS 1826 

Query: 246 RVBWSVEYPIIGDEIJWNSACRKV^^ 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
Sbjct: 1827 SVNWNLEYPIISNEVSVOTSO^ 1884 

Query: 306 KFYDAQPCSDKAYKIEELFYSYAIHHDKF^ 365 

KFYDA P +++ Y Y H D+F DG+C+FWNCNVIHYPANAWCRFiyniVL+ 

Sbjct: 1885 KFYDASPV VKSVKQF\fiflRYEAHKDQFLTO^ 1941 

Query: 366 LNLPGCIX^SLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSD 425 

LNLPGC+GGSLYVNKHAFHT F ++AF NLK +PFFYYSD+PC +DYVPL 
Sbjct: 1942 LNLPGCNGGSLYVNKHAFHTSPFTRAAFENLKPMPFFYYSOT 2001 

Query: 426 KSATCITRCNLGGAVGRHHANEYR^ 485 

+SATCITRCNLGGAVC HA EYR+YL++YN , +AGF+ W+YK FD YNLWNTFTRLQS 
Sbjct: 2002 RSATCITRCNLGGAVCLKHAEEYREYLESYOT 2061 

Query: 486 L 486 
L 

Sbjct: 2062 L 2062 
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>gi| 17529672 |gb|AAL40397.l|AF220295_2 RNA polymerase lb [bovine 
coronavirus] 

Length = 2685 

Score = 623 bits (1607), Expect = e-177 

Identities = 282/481 (58%), Positives = 365/481 (75%), Gaps = 5/481 (1%) 

Query: 6 imTRRLISMMGFKMNYQWGYPNMFira 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 1574 VTYSMjISLMGFKLDVTLIX?^ 1633 

Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTR^ 125 

LGPSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 1634 LGFSTGIDFWEATGLFADRIX^ 1693 

Query: 126 MLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKI^ 185 

M +D L LSD W V WA FELT ++YF K+G E +C + KRAT +++ + Y CW 
Sbjct: 1694 MFADHLIDLS1XZVVLVTWAANFELTC 1753 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQTO 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 1754 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVASSD 1813 

Query: 246 RVDWSVEYPI IGDELRWSACElKVQHMVVKSALIiADK^ 305 . 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 1814 NIMtfNVEYPIISNELSINTSCR^ -KDFDF 1871 

Query: 306 KFYDAQPCSDKAYIQiEELFrSYMHHDKFTDG^ 365' 

KFYDAQP ++ L Y + H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 

Sbjct:. 1872' KFYDAQPI VKSVKTLLYFFEAHKDSFKDGLCMFWNCNVDKYPPN^ 1928 

Query: 366 LNLPGCIX3GSLYVNKHAFHT 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 1929, LNLPGCNGGSLYVNKHAFHTKPFSRAAFEHLK^ 1988 

Query: 426 KSATCITRCNLGGAVCRHHANEYRQYIJ)AYNMMI SAGFSLWIYKQFDTYIOINTFTRLQS 485 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQS 
Sbjct: 1989 KSATCITRCNLGGAVCLKHAEEYREYLES 2048 

Query: 486 L 486 
L 

Sbjct: 2049 L 2049 



>gi | 25121571 |ref|NP_740618.l| coronavirus nspll [Murine hepatitis virus] 
Length = 521 

Score = 622 bits (1603), Expect = e-177 

Identities = 284/479 (59%), Positives = 362/479 (75%), Gaps = 5/479 (1%) 

Query: 6 MTYRRLISMMGFKMNYQWGYPNMFITREEMRHVRATO 65 

+TY RLIS+MGFK++ ++GY +FITR+EAI+ VRAW+GFD EG HA RD++GTN PLQ 
Sbjct: 48 VTYSRLISLMGFKLDLTLDCT 107 



Query: 66 LGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPP 125 
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LGFSTG++ V TG + F + A+ PPG+QPKHLIPLM +G W+WRI+IVQ 
108 LGFSTGIDFVVEATGMFAEWXJ^ 167 

126 MLSDTLKGLSDRVVFVLWAHGFKLTSMKYFVKIGPKRTC 185 

MLSD L L+D WVWA FELT ++YF K+G E C +C KRATCF++ + Y CW 
168 MLSDHLADIADSVVLVTWAASFELTCLRYFAKV 227 

186 HSVGFDYVYOTFMIWQQWGFTO 245 

HS DY+YNP ++D+QQWG+TG+L SNHD C VH AHVAS DAIMTRCLAVH+CF K 
228 HSYSmYLYNPLl^IQQWGYTGSLTSNHDPICSVH^ 287 

246 RVDWSVEYPIIGDELRVNSACRKVQHMW 305 

V+W++EYPII +E+ VN++CR +Q ++ ++A+L +++ V +DIGNPK + CV ++ 
288 SVNWNLEYPIISNEVSVOT^ 345 

306 KFYDAQPCSDKAYKIEELFYSYAIHHDKFnX^LFW^ 365 

KFYDA P +++ Y Y H D+F rc+C+FWNCNVmYPANA+VO^TRVIrf 

346 KFYDASPV VKSVKQFVYKYEAHKDQFLDGLCMFWNC^ 402 

366 LNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLP 425 

LNLPGC+GGSLYVNKHAFHT F++AF NLK +PFFYYSD+PC +DYVPL 
403 LNLPGCNGGSLYVNKHAFHTSPFTRAAFE 462. 

426 KSATCITRCNLGGAVCRHHANEYRQYLDAYNMM 484 

+SATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFTRLQ 
463 RSATCITRCNLGGAVCLIOIAEEYREYLESYNTATTA 521 



>gi | 26008092 | ref |NP_742140.l| coronavirus nspll [Bovine coronavirus] 
Length = 521 

Score = 617 bits (1590), Expect = e-175 

Identities = 282/479 (58%) , Positives = 365/479 (76%), Gaps = 5/479 (1%) 

Query: 6 MTYRIUJIS^GFKMNYQVNGYFNMFI 65 

+TY RLIS+MGFK++ ++GY +FIT+EEA++ VRAW+GFD EG HATRD++GTN PLQ 
Sbjct: 48 VTYSRLISIMGFKLDVTLDGYCKLFIT^ 107 

Query: 66 LGFSTGVNLVAVPTGYVDTENOTEFTRVNAKP 125 

LGFSTG++ V TG + F + AK PPG+QFKHLIPLM +G W+WR +IVQ 
Sbjct: 108 IXSFSTGIDFVmTGLFADRIX^SFKKAVAKA^ 167 

Query: 126 MLSOTLKGLSDRVVFVLWAHGFELTSM 185 

M +D L LSD WVWA FELT ++YF K+G E +C +C KRAT +++ + Y CW 
Sbjct: 168 MFADHLIDLSIXJVVLVTWAZOT 227 

Query: 186 HSVGFDYVYNPFMIDVQQWGFTGNLQS^ 245 

HSV DY+YNP ++D+QQWG+ G+L SNHD +C VH AHVAS DAIMTRCLAV++CF 
Sbjct: 228 HSVTCDYLYNPLIVDIQQWGYIGSLSSNHDLYCSVHKGAHVAS 287 

Query: 246 RVDWSVEYPIIGDELRWSACRKVQHMVV^ 305 

++W+VEYPII +EL +N++CR +Q +++K+A+L +++ + +DIGNPKAI CV + ++ 
Sbjct: 288 NINWNVEYPI I SNELSIOTSCRVLQRVMLKAA -KDFDF 345 

Query: 306 KFYDAQPCSDKAYOEELFYSYAIHHDKFTDGVCI^ 365 
KFYDAQP ++ L YS+ H D F DG+C+FWNCNVD+YP NA+VCRFDTRVL+N 



Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct : 
Query: 
Sbjct: 



WO 2004/092360 



168/193 



PCT/DS2004/011710 



Sbjct: 346 KFYDAQPI---VKSVKTIiYSFEAHKDSFIO^ 402 

Query: 366 LNLPCXIDGGSLYVNK PCESHGKQWSDIDYVPL 425 

LNLPGC+GGSLYVNKHAFHT F ++AF +LK +PFFYYSD+PC +DYVPL 
Sbjct: 403 LNLPGCNGGSLYVNKHAFHTKPFSRA^ 462 

Query: 426 KSATCITRCNLGGA^^ 484 

KSATCITRCNLGGAVC HA EYR+YL++YN +AGF+ W+YK FD YNLWNTFT+LQ 
Sbjct: 463 KSATCITRCNLGGAVCXKHAEBYREYLESYNTATTAGF^ 521 



>gi 1 10242469 1 ref |NP_066134.l| ORFlab polyprotein; frameshift product [Avian 
infectious bronchitis 
virus] 
Length = 6629 

Score = 575 bits (1482) , Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQWGYFNMFITREE^ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 5515 EiTYKHLISLLGFKMSVNVEGCHNMFITRDEAIRNTO 5574 

Query: 65 QLGFSTGVNLVAVFTGYVDTENN^ 124 

Q+GFSTG + V P G VDT F VN+K. PPG+QF HL li FW+V+R +IV 
Sbjct: 5575 QVGFSTGADFVVTPEGLVDTSIGNNFEPVNSKAPPGEQFNHLR 5634 

Query: 125 QMLSOTLKGLSDRWFVLWAHGFE 184 

QML+D L +SD WFV W HG BLT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 5635 QM^NLCWSIX^A/FVTWCHGI^ 5693 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQ 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 5694 KHCLGFDFVYNPLLVDIQQWGYSG^ 5753 

Query: 245 KRVOTSVEYPIIGDELRVNSACRK^ 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDVNWDLTYPHIANEDEVNSSCRYLQRM^ 5813 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYMHHDKFTIX^ 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 5814 FRFYDKNPIVRNVKQFE YDYNQHKDKFAIX3LCM^^ 5870 

Query: 365 NLNLPGCDGGSLYVNITOAFOT 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 5871 VFNLPGCNGGSLYVNKHAFYTPKFDRISF^ 5929 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDAYNM^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 5930 LATKDCITKC^GGA^ 5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 
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>gi 1 14149033 |emb|CAC39112.1| replicase polyprotein lab [Avian infectious 
bronchitis virus (strain 
Beaudette CK)] 
Length = 6629 

Score = 575 bits (1482), Expect = e-163 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQWGYFNMFITREEMR 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GPDVE HA +GTNLP 
Sbjct: 5515 EITYKHLISLLGFKMSVNVEGOJ^ 5574 

Query: 65 QLGFSTGVNLVAVPTGYVOTENNT^ 124 

Q+GFSTG + V P G VDT P VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 5575 QVGFSTGADFVVTPEGLVOTSIGNNFEPVNSK 5634 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCI^ 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 5635 QMIJtf)NLCOTSIXM7FV^ 5693 

Query: 185 NHSVGFDYVYOTFMIDVQQ^^ 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 5694 KHCLGFDFVYOTLLVDIQQWGY^ 5753 

Query: 245 KRVOTSVEYPIIGDELRWSACRKVQHM^ 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 5754 QDVNWDLTYPHIANEDEWSSCRYLQRMYI^ 5813 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYMH^ 364 

++FYD P + E YY HDKP DG+C+FWNCNVD YP N++VCR+DTR LS 

Sbjct: 5814 FRFYDKNPIVRNVKQFE- - -YDYNQHKDKFAIX5LCMFVWCNVDOT 5870 

Query: 365 - OTiNLPGCIX^SLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCT 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 5871 VFNLPGCNGGSLYVNKHAFYTPKFD^ 5929 

Query: 425 LKSATCITRCNLGGAVCRHHANEYR 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 5930 LATKIX^ITKCNIGGAVCKIQiAQMYAEFTO 5989 

Query: 485 SL 486 
S+ 

Sbjct: 5990 SI 5991 

>gi|458735|emb|CAA83018.l| potential chimeric protein [Avian infectious 
bronchitis virus] 

Length = 2155 

Score = 570 bits (1470) , Expect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQWGYPNM^ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1596 EITYKHLISLLGFKMSVNVEGCHNMFITR^ 1655 
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Query: 65 QLGFSTCJVNLVAVPTGTO 124 

Q+GPSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 1656 QVGFSTGADFVVTPEGLVDTS IGNNFEPWSKAPPGEQFNHLRVLFKSAKPWHVIRPRW 1715 

Query: 125 QMLSDTLKGLSDRVVFVLWAHGFBLTSMKYFVKIGPER 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1716 QMI^NLCWSDCWFVTWOT^ 1774 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQV^ 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 1775 KHOiGFDFVYNPLLVDIQQWGYSGNLQFNHDLHCNVHG 1834 

Query: 245 KRVDWSVEYPIIGDELRWSA^ 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1835 Qm/NWDLTYPHIANEDEVNSSCRYLQRMYL^ 1894 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHH^ 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 1895 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWN^ 1951 

Query: 365 NLNLPGCIX^SLYVNKHAFHTPAFDKSAFTN^ 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 1952 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRNLKAMPFFFYDSS 2010 

Query: 425 LKSATC ITRCNIX5GAVCRHHANEYRQYLDAY1MMI SAGFSLWIYKQFDTYNLWNTFTRLQ' 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 2011 LATKDCITKCNIGGAVCKKHAQl^ 2070 

Query: 485 SL 486 
S+ 

Sbjct: 2071 SI 2072 



>gi| 133594 |sp|P26314|RRPB_IBVB RNA-DIRECTED RNA POLYMERASE (ORFlB) 

gi j 74826 |pir | |VFIHB2 genome polyprotein - avian infectious bronchitis 
virus {strain 



Beaudette) 



gi | 292953 
gi 331173 



gb 
gb 



AAA70234.1 
AAA46224.1 



Length = 2652 



pol protein [Avian infectious bronchitis virus] 
ORFlb [Avian infectious bronchitis virus] 



Score = 570 bits (1469), Expect = e-161 

Identities = 262/482 (54%), Positives = 344/482 (71%), Gaps = 5/482 (1%) 

Query: 5 DMTYRRLISMMGFKMNYQWGYPNMFI 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 EITYKHLISLIX5FKMSVNVEGCHNMFITRDEM 1597 

Query: 65 QLGFSTGWNLVAVPTGYVDTEN^ 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L FW+V+R +IV 
Sbjct: 1598 QVGFSTGADFVVTPEGLVOTSIGNNFEPVNSKAPPGEQFN^ 1657 

Query: 125 QMLSDTI^GLSDRVVFVLWAHGFELTSMKYFVKIGP 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1658 QMLADNLCWSDCWFVTWCMGLELTTLR - CGSRATTFNSHTQAYACW 1716 
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Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSN^ 244 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 1717 KHCLGFDFVYNPUiVDI(X3WGYS6NLQFNHDI^ 1776 

Query: 245 KRVDWSVEYPIIOTELRVN^ 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1777 QDVNWDLTYPHIANEDEVNSSCRY^ 1836 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYMHHDKFTDGV^ 364 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 1837 FRFYDKNPIVRNVKQFE YDYNQHKDKFADGLCMFWNCNV^^ 1893 

Query: 365 NLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLK^ 424 

NLPGC+GGSLYVNKHAF+TP FD+ +P NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 1894 VFNLPGCNGGSLYVNKHAFYTPKFDRISFRTC 1952 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLDA 484 

Ij + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 1953 LATKDCITKCOTGGAVOCK^ 2012 

Query: 485 SL 486 
S+ 

Sbjct: 2013 SI 2014. 



>gi|29293454|gb|AAO67706.l| ORFlb polyprotein [Avian infectious bronchitis 
virus] 

Length = 2649 
Score = 565 bits (1455), Expect = e-160 

Identities = 261/482 (54%), Positives = 342/482 (70%), Gaps = 8/482 (1%>) 

Query: 5 DMTYRRLISMMGFKMNYQWGYPN^ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 1538 EITYKHLISLLGFKMSVNVEGCHNMFITRDK^ 1597 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPrc 124 

Q+GFSTG + V P G +DT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 1598 QVGFSTGADFVVTPEGLIDTSIQWEPVNSKAPPGEQFNHLRW 1657 

Query: 125 QMLSOTLKGLSDRVVFVLWAHGFELT 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 1658 QMLADNLCNVSIXJVVFV 1716 

Query: 185 NHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVA^ 244 

H +G VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DA+MTRCLA++ F 
Sbjct: 1717 RHCLG VYNPLLVDIQQWGYSGNLQFNHDLHCNVHGHAHVASM 1773 

Query: 245 KRVDWSVEYPI IGDELRWSACRKVQHMVVKSAIJjADKFPVLHDIGNPKM 304 

K V+W ++YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 1774 KDVNWELQYPHIANEDEVNSSCRYLQRMYLN^ 1833 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYMHHDKFT^ 364 

++FYD P + E Y Y+ H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 1834 FRFYDKNPIVPNVKQFE YDYSQHKDKFADGLCMFWNCNVDCyPENSLVCRYDTRNLS 1890 
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Query: 365 NLNLPGOXMSLYVNKHAFHTPAFDKS^ 424 

NLPGC+GGSLYVNKHAFHTP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 1891 VFNLPGCNGGSLYVNIQIAFinTKFDR^ 1949 

Query: 425 LKSATCITRCNLGGAVCRHHANEYRQYLI^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ F+ YNLW F+ LQ 
Sbjct: 1950 LATKTCITKCNIGGAVaaa^^ 2009 

Query: 485 SL 486 
S+ 

Sbjct: 2010 SI 2011 

>gi 1 25121555 1 ref |NP_740631.l| coronavirus nspll [Avian infectious 
bronchitis virus] 

Length =521 

Score = 559 bits (1440), Expect = e-158 

Identities- = 261/480 (54%) , Positives = 342/480 (71%), Gaps = 5/480 (1%) 

i 

Query: 5 IMTYRRLISMMGFKMNYQ 64 

++TY+ LIS++GFKM+ V G NMFITR+EAIR+VR W+GFDVE HA +GTNLP 
Sbjct: 47 EITYKHLISLLGFKMSVNVEGCHNMFITra 106 

Query: 65 QLGFSTGVNLVAVPTGYVDTENNTEFTO^ 124 

Q+GFSTG + V P G VDT F VN+K PPG+QF HL L PW+V+R +IV 
Sbjct: 107 QVGFSTGADFVVTPEGLVDTSIGNNra 166 

Query: 125 QMLSOTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTC 184 

QML+D L +SD WFV W HG ELT+++YFVKIG E+ C C RAT F++ + YACW 
Sbjct: 167 QMLADNLCNVSDCVVF 225 

Query: 185 NHSVGFDYVYNPFMIDVQQVTCFTGNLQSNHDQHC 244 : 

H +GFD+VYNP ++D+QQWG++GNLQ NHD HC VHG+AHVAS DAIMTRCLA++ F 
Sbjct: 226 KHCLGFDFVYNPLLVDIQQWGYSGN^ 285 

Query: 245 KRVDWSVEYPIIGDELRVNSACRKVQH^^ 304 

+ V+W + YP I +E VNS+CR +Q M + + + A K V++DIGNPK IKCV + +V 
Sbjct: 286 QDVNWDLTYPHIANEDEVNSSCRY 345 

Query: 305 WKFYDAQPCSDKAYKIEELFYSYAIHHD^^ 364. 

++FYD P + E Y Y H DKF DG+C+FWNCNVD YP N++VCR+DTR LS 
Sbjct: 346 FRFYDKNPIVRNVKQFE TOYNQHKDKFAIX3LCMFWNCNTO 402 

Query: 365 NLNLPGCDGGSLYVNKHAFOTPAFDK^ PCESHGKQWSDIDYVP 424 

NLPGC+GGSLYVNKHAF+TP FD+ +F NLK +PFF+Y SPCE+ V+ D V 
Sbjct: 403 VFNLPGCNGGSLYVNKHAFYTPKTORISF 461 

Query: 425 LKSATCITRCNLGGAVCR^^ 484 

L + CIT+CN+GGAVC+ HA Y +++ +YN ++AGF+ W+ + + YNLW +F+ LQ 
Sbjct: 462 LATIQX3ITKCNIGGAVCKKHAQMYAEFVTSYNAAW 521 



>gi | 9635157 | ref |NP_058422.l| replicase [Transmissible gastroenteritis 
virus] 

gi 1 7801348 |emb|CAB91143.l| replicase [Transmissible gastroenteritis virus] 
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Length = 6685 
Score = 545 bits (1403), Expect = e-153 

Identities = 261/484 (53%), Positives = 335/484 (69%), Gaps = 13/484 (2%) 

Query: 4 IQ3MTYRRLISMGFKMNY^ 63 

KD+ Y +IS MGF+ + GY +F TR+ A+R+VRAW+GFDVEG H D VGTN+P 
Sbjct: 5574 KDVKYANVISYMGFRFEANI 5633 

Query: 64 LQIX5FSTGVNLVAVPTCYVDTENNTEFTO 123 

LQLGFS GV+ V 6 V TE V A+ PPG+QF HLIPLM KG PW++VR +1 

Sbjct: 5634 LQIKSFSNGVDFWQTEGCVITEK^ 5693 

Query: 124 VQMLSDTLKGLSDRVVF^ 182 

VQM+ D GLSD ++FVLWA G ELT+M+YFVKIG P++ C C K ATC+S+S YA 
Sbjct: 5694 VQMVCDYFIXSLSDILIFVLWAGGLELTT^YFVKIGRPQK — CECGKSATCYSSSQSVYA 5751 

Query: 183 CTNHSVGFDYVYNPFMIE^QWGFTGNI^ 242 

• C+ H++G DY+YNP+ ID+QQWG+TG+L NH + C +H N HVAS DAIMTRCLA+H+C 
Sbjct: 5752 CFKHALGCDYLYNPYCIDIQQWGYTGSLSMNHHEVCNIHR^ 5811 

Query: 243 FVKRVDWSVEYPIIGDELRWSAOUCVQHMVVKSAI^ 302 

FVKRVDWS+ YP I +E ++N A R VQ V+K+AL +HD+GNPK I+C 

Sbjct: 5812 FVKRVDWSIVYPFIDNEEKINKAGRIVQSHVMKAAI^ 5870 

Query: 303 VEWKFYDAQPCSDKAYKIEELFYSYMHHDK^ 362 

+ W YD. P H + L Y Y +H +G+ LFWNCNVD YP +IVCRFDTR 
Sbjct: 5871 IPWFCYDRDPINN NVRCLDYDYMVHGQ — MNGLMLFWNCNVDMYPEFSIVGRFDTRT 5925 

Query: 363 LSNL^PGCDGGSLYVNIQIAFHTPAFDKSAFTNLK 422 

S L+L GC+GG+LYVN HAFHTPA+D+ AF LK +PFFYY DS CE V +Y 
Sbjct: 5926 RSKLSLEGCNCX3ALYVNNHAFOT LVDGQPNY 5981 

Query: 423 VPLKSATCITRONIX^VCRHHA 482 

VPLKS CIT+CN+GGAVC+ HA' YR Y++ YN+ + AGF++W + FDTY LW+ F 
Sbjct: 5982 VPLKSNVCITKCNIGGAVCKCTAALYRAYVEDYNIFM 6041 

Query: 483 LQSL 486 
++L 

Sbjct: 6042 SKAL 6045 



>gi 
gi 



19387582 
13752450 



ref |NP__598309.l| Poll [porcine epidemic diarrhea virus] 
gb|AAK38661.l| Poll [porcine epidemic diarrhea virus] 



Length = 6781 



Score = 541 bits (1394) , Expect = e-152 

Identities = 256/480 (53%), Positives = 334/480 (69%), Gaps = 12/480 (2%) 

Query: 8 YRRLISMMGFKMNYQWGYPNMFITREEMRHVRAW^ 67 

Y +IS MGF+ + + + +F TR+ A+R+VR W+GFDVEG H VGTN+PLQLG 
Sbjct: 5675 YEHVISFMGFRFDINIPNHHTLFCTWDFAMRNVRG 5734 

Query: 68 FSTGVNLVAVPTGYVOTENNTEF 127 

FS GV+ V P G V TE+ V A+ PPG+QF HL+PL+ +G FW+WR +IVQM 

Sbjct: 5735 FSNGVDFVVRPEGCVVTESGDYIKPVRARAPPGEQFAHLLPLLKR 5794 



A 
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Query: 128 SOTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLOT 187 

SDL LSD ++FVLWA G ELT+M+YFVKIGP ++C C K ATC++++ TY C+ H+ 
Sbjct: 5795 SDYLANLSDILIFVLWAGGLKLTTMRYFVKIGPSKSCI)- 5853 

Query: 188 VGFDYVYNPFMIDVQQWGFTGN^ 247 

+G DY+YNP+ ID+QQWG+ G+L NH +HC VH N HVAS DAIMTRCLA+H+CFVK V 
Sbjct: 5854 LGCDYLYNFYCIDIQQWGYKGSLSL 5913 

Query: 248 DWSVEYPIIGDELRWSACRKVQHMVVKSALLM 307 

DWS+ YP IG+E +N + R VQ ++S L ++DIGNPK I+C + +W 

Sbjct: 5914 DWSITYPFIGNEAVINKSGRIVQSHTMRSVLK^ 5972 

Query: 308 YDAQPCSDKAYKIEELFYSYMHHDKFTIXjVC^ 367 

+D P + +E Y Y I H +F DG+CLFWNCNVD YP ++VCRFDTR S LN 
Sbjct: 5973 FDKNPTNSNVKTLE YDY-ITHGQF-IXajCLFWNaNVDMYP 6027 

Query: 368 LPGCDGGSLYVNKHAFHTPAFDKSMTNI^ 427 

L GC+GGSLYVN HAFHTPAFDK AF LK +PFF+Y D+ C+ ++ I+YVPL++ 
Sbjct: 6028 LEGCNGGSLYVNNHAFOTPAFDKRAFAKLKPMPFFFYDDTECT KLQDSINYVPLRA 6083 

Query: 428 ATCITROJLGGAVCRHHANEYRQYL^ 486 

4- CIT+CN+GGAVC H Y- Y++AYN SAGF++W+ FDTYNLW TF+ LQ L 
Sbjct: 6084 SNCITKCOTGGAVCSKHCAMYHSYVN^ 6143 

>gi (12175747 1 ref|NP_073549.1 1 replicase polyprotein lab [Human coronavirus 
229E] 

gi|l2082740|gb|AAG48591.l|AF304460_2 replicase polyprotein lab [Human 
coronavirus 229E] : 

Length = 6758 

Score = 535 bits (1379), Expect = e-151 

Identities = 254/478 (53%), Positives = 329/478 (68%), Gaps = 13/478 (2%) 

Query: 7 TYIUUjISI^GFKMNYQWGYPNMFI 66 

TY +IS MGF+ + + G ++F TR+ A+RHVR W+G DVEG H T D VGTN+PL1Q+ 
Sbjct: 5642 TYEHVISYMGFRFDVSMPGSHSLFCTTOF 5701 

Query: 67 GFSTGVNLVAVPTGYVDTENOTEFTRVNAKPPPGI^ 126 

• GFS GV+YAPGVT + VA+ PPG+QF H++PL+ KG FW+V+R +IVQM 
Sbjct: 5702 GFSNGVDFVAQPEGCVLTNTGSV^^ 5761 

Query: 127 LSDTLKGLSDRVVFVLWAHGFELTSMKYFVKEGPERTC 186 

++D L G SD +VFVLWA G ELT+M+YFVKIG + C C ATC+++ S+ Y C+ H 
Sbjct: 5762 IADFLAGSSDVLVFVLWAGGLELTTM^^ 5820 

Query: 187 SVGFDYVYNPFMIDVQQWGFTGNLQSNHDQH^ 246 

++G DYVYNP++ID+QQWG+ G+L +NH C VH N HVAS DAIMTRCLAV++CFVK 
Sbjct: 5821 ALGCDYVYNPYVIDIQQWGWGSLSTNHHM 5880 

Query: 247 VDWSVEYPIIGDELRWSACRKVQHM^ 306 

VDWS+ YP+I +E +N R VQ ++++A+ +HDIGNPK I+C + +W 

Sbjct: 5881 VDWSITYPMIANENAINKGGRTVQS 5939 

Query: 307 FYDAQPCSDKAYIQIEELFYSYAIHHDKFTIXjVC^^ 366 
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YD P + +E Y Y H DG+CLFWNCNVD YP +IVCRFDTR S L 
Sbjct: 5940 CYDKNPINSNVKTLE YDYMTHGQ — MDGLCLFWNCNVDMYPEFSIVCRFDT^^ 5994 

Query: 367 NLPGCDGGSLYVNKHAFHTPAFDKSAPTMiKQ 425 

NL G +GGSLYVN HAFHTPA+DK A LK PFFYY D CE WD ++YVPL 
Sbjct: 5995 NLEGVNGGSLYVNNHAFHTPAYDIQlAMAKIiKPAP WHDQVNYVPL 6049 

Query: 426 KSATCITRCNLGGAVCRffl 483 

++ CIT+CN+GGAVC HAN YR Y+++YN+ AGF++W+ FD YNLW TFT + 
Sbjct: 6050 RATNCITKCOTGGAVCSKHANL^ 6107 

>gi 133591 1 sp Pl8458|RRPB_BEV RNA-directed RNA polymerase (ORF1B) 
gi 94017 |pir |S11238 polymerase - Berne virus 

gi 1334814 |emb|CAA36601.l| 2nd polymerase reading frame (AA 1-2291) [Berne 
virus] 

Length = 2291 
Score = 50.1 bits (118), Expect = 8e-05 

Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 11/103 (10%) 

Query: 140 . FVLVJAHGFELTSMKYFVKIGPERTC — CLCDKRATCFSTSSDTYACWNHSVGF — DYVYN 195 

F+L++ +L S+K++V+ TCCC+AC +YCN G +. N 
Sbjct: 1511 FILYSCSNDLKSLKFYVEFD TCYFCSCGEMAIC^MRDGN-YKC^CTGGMLISKLVN 1566 

Query: 196 PFMIDVQQWGFTGMLQSNHDQHC-QWGNAHVASOT 237 

+DVQ+ LQ HD C Q HG++H A CDA+MT+CL 

•Sbjct: 1567 CKYLDVQKERV — KLQDAHDAICQQFHGDSHEALCDAVMTKCL 1607 

>gi|l51306l|dbj |BAAl3323.l| cyanoprotein alpha subunit precursor [Riptortus 
clavatus] 

Length = 693 
Score = 34.7 bits (78), Expect = 3.7 

Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 1/36 (2%) 

Query: 371 CDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSD 406 

C G LY +KHA P FD+ A+ + Q+P FY+ D 
Sbjct: 643 CGGSKLYDSKHAMGFP-FDRPAYPDAFQVPNFYFKD 677 



Database: All non- redundant GenBank CDS 
trans la tions +PDB+Swi s s Prot+PIR+PRF 

Posted date: Apr 11, 2003 2:30 AM 
Number of letters in database: 454,141,287 
Number of sequences in database: 1,411,415 

Lambda R H 

0.325 0.139 0.456 



Gapped 

Lambda K H 

0.267 0.0410 0.140 
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Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 473,361,261 

Number of Sequences: 1411415 

Number of extensions: 20503315 

Number of successful extensions: 51018 

Number of sequences better than 10.0: 27 

Number of HSP's better than 10.0 without gapping: 26 

Number of HSP's successfully gapped in prelim test: 1 

Number of HSP's that attempted gapping in prelim test: 50937 

Number of HSP's gapped (non-prelim): 33 

length of query: 486 

length of database: 454,141,287 

effective HSP length: 127 

effective length of query: 359 

effective length of database: 274,891,582 

effective search space: 98686077938 

effective search space used: 98686077938 

T: 11 

A: 40 

XI: 15 ( 7.0 bits) 
X2: 38 (14.6 bits) 
X3: 64 (24.7 bits) 
SI: .40 (21.6. bits) 
S2: 75 (33.5 bits) 
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FIGURE 125A 



4- 
3- 
2 
1- 
0 

-1 

-2 

-3H 



I i l i I • 1 i l i l i I i I i l i l i l i l i I i I i I i l i I i I i I i l i I i l i l i I % 

0 26 40 60 80 101 120 140 161 180 200 220 240 260 280 300 320 340 360 380 400 420 440 460 



FIGURE 125B 
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FIGURE 126 

5'3' Frame 1 

QVHQNVCVL-LIFYLMTLSR — SHKICQ-FQKWSRLQLTMLKFHSCFGVPMDMLKPSTQN 

YKQVKRGNQVLRCLTCTRCKECFLKSVTFRIMVKMLLYQKE MSQSILNCVNT-IHLL 

-LYPPT-ELFTLVL 

5'3* Frame 2 

RFIKMCVFCD-SFT — LCRDNKVTRFVSDFXSGQGYN-LC-NFIHALV-GWTC-NLLPKT 
TSKSSVATRCCDA-LVQDAKNAS-KV-PSELW-KCCYTKRNM)ECRKVYSWSILm , YF 
SCTLQHESYSLWCW 

5 • 3 • Frame 3 

GSSKWCSVIDLLLDDFVEIIKSQD 

QASQAWQPGVAMPNLYKMQRMLLEKCDLQ]^^ 

AVPSNMRVIHFGAG 

3'5' Frame 1 

PAPK-ITLMLEGTAKVSVFKY-HS-VYFATFIIIPFGITAFSP-F-RSHFSRSILOILYK 
LGIATPGCHA-MCSFX3-KVSTCPSLHQSMNEISA-SIVTLTTFEITDKSCDFIISTKSS 
SKRSITEHTHFDEP 

3'5' Frame 2 

QHQSE-LSCWRVQLK-raSIDTVEYTLRHSSLFLLV-QHFHHNSEGHTFQFAFFASCTS 
-ASQHLVATLDLLVVLGRRFQHVHPYm-MKFQHSQL-P-PLLKSLTNLVTLLSRQSHQ 
VKDQSQNTHILMNL 

3 ' 5 ' Frame 3 

STKVMSHVGGYS-SKCI-VLTQLSILCDIHHYSFWYNSIFTIILKVTLFKKHSLHLVQV 

RHRNTWLPRLTCL-FWVEGFMSILTPKHE-^ 

-KINHRTHTF-T 
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FIGURE 127 

5'3' Frame 1 

-VFTYP6KANQPRSLVDLFSKRTN-NV — WTPIKPT-CPPHYIWWTHRFN-Q-PEWRTAM 

GQGQNSADPKVYPI ILRLGSQLSLSMARI^jDSLEARAFQSTPIWQMTKLiATTEELPDE 

FVWTM-KSSAPDGTSIT-ELAQKLHFPT^ 

MLPPCYNFLKEQHCQKASTQREAEAAVKP^ 

VVKLPSRYCC-TD-TSLRAKFLVKA1MJKMLS 

LGDWQNKPKEISGTKT-SDKELITNIGPQIAQFA 

5>3> Frame 2 

RFLPTQEKPTNLDLL-ICSLNEQIKMSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGLQW 
GKAKTAPTPRFTQ-YCVLVHSSHSAWQGGT-IPSRPGRSNQHQ-WSR-PNWLLPKSYPTS 
SWW-RQNERAQPQMVLLLPRNWPRSFTS^ 

CCHRATTSSRNNIAKRLLRRGKQRRQSSLFSLLIT-SR-FKKFNSWQQ-GKFSCSNG-RR 
W-NCPRMMRQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-KASPKTYCMTVQRHSSI 
WETWSRTNPRKFRGPRPNOTRN-LQTLGRKLHNLP 

5'3' Frame 3 

GFYLPPJCSQPTSISCRSVL-TNKLKCLIMDPNQTNVVPPALHLVDPQIQLTITRM 

MPKQRRPQGLPMIASWFTALTQHGKEELRFPRGQGWINTNSGPDDQIGYYRRATRRV 

RGGIX3KMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGA 

MTVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGG 

GETALALLLJ^RLNQLESKVSGKGQQQQGOWTKKSAAEASKKPRQKR 

GRRGPEQTQGNFGDQDLIRQGTDYKHWAANCTIC 

3' 5* Frame 1 

RQIVQFMQCL-SWCLIRSWSPKFPWCSGPRLPNA-VTLYCFVAVRFWRGFLDASAAD 
FLVTVWPCCCWPLPETLLSSWFNLSSSNSARAVSPPPLAIRAGEFPLLLPGVEFLELPRL 
RDEEREEA-LPPLLPSA-KPFGNVVP-GSCSWAALLLGLRVPMSLGVFKAPSVATHTM 
PSLLAP-GSEASGPVPR — KYHLGLSSFILPSPPRTRRVALR — PIWSSGPLLVLIGTPW 
PRGOTiSSSLPC-VRAWQDMLLGKPWGRRCFGLAPLQSSILVIVS-ICGSTKCNAGGTT 
LV-LGSIIRHFNLFV-RTDLQEIEVGWLFLGR-KP 

3' 5' Frame 2 

GKLCmRPWCNQFLV-LGLGPRNFLGmDHVSQMLE-RCmWQYVFGEAF-MPQQQI 
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S — QFGLVWGLYQKLCSQAGSICLAAIARGQFHHLR-PFEQE^ 

VMRSEKRLTCRLCFPLRRSLLAMLFLEEWARWQHCT 

LLC-RRREVKLLGQFLGNRSTIWG-ALSFCRHHHELVG-L 

LEGI-WPCHAE-EL-TKTQYYWVNLGVGAVLALPHCSPPFWLLSVESVGPPN^ 

WFDWGPLSDILICSFREQIYKRSRLVGFSWGKNL 

3'5' Frame 3 

MCMCGPMFVISSLSD-VLVPEISLGLFWTTSPKCLSDWLFCGSTFLARLFRCLSSRF 

LSDSI^LLLAFTRNFAIJa.VQSV-QQ-REGSFTTSASHSSmSPTAARS-IS-ITATT 

— GARRGLTAASASLCVEAFWC€CSLRKL-HGGSI^ 

FWSAVGK-SFWASS-VIEVPSGAELFHFAVTT^ 

SRESKFLLAMLSESCEPRimiG-TLGSALFWPCPIAVLHSGYCQI^ 

GLIGVHY QTF - FVRLENRSTRDRGWLAFPG-VKT 



FIGURE 128 

-GLELKL-LTSICAF-PFCYSLF — CLLYFGFHSKSRI-KNLVPKSKRT-NFSLF-LVFL 
YAVAYAL-YSAVHLINMCLKILVRYM , RGNTYSTAVttCAIX3KVLPFH^ 
WINCQDPAGGALIMCWYLHEGHQTAAFRDVLVVLNKRTN-NV- -WTPIKPT-CPPHYI 
l^HRFW-Q-.PmTQWGKAKTAPTPRFTQ-YC^VHSSHSAWG<3GT-iPSRPGRSNQHQ 
; -WSR-PNV^LPKSYPTS^-RQNERAQPQMVLLLPRIWPRSFTSLRR-QRRHRMG€N-G 
SLEYTQRPHWHPQS--QCCHRATTSSRMIAKRLIJUIGKQRRQSSLFSLLIT-SR-FKKF •• 
NSWQQ-GKFSCSNG-RRW-NCPRAIAMQIEPA-EQSFW-RPTTTRPNCH-EICC-GI-K 
ASPKTYCHKWQRHSSIWETWSRTNPRKFRGPRPNO^^ 

WHMGSHTFGMADLSWSH-IG-QRSTIQRQRHTMQM-RIQNIPTNRA-KGQKEKD- 
-SSAFAAETKEAAHCDSSSC 

EDSSSSFN-LLFVLFSLSAIPCFNNAYYILVFTRNPGSRRTLYQSLNEHETSHCFDLYFS 
MQLHMHCSTALCI — TSCA-RSL-GTTLGVILIALLGFVL-ERFnjFIIX3TLWFKHAHM 
LLSTVKIQLVm-LGVGTFMKVmLHLETYLLF-INEQIKMSDNGPQSNQRSAPRITF 
GGPTDSTDMQNGGRNGAP^KQRRPQGLPNmSWFTALTQHGKEELRFPRGO^INra 
SGPDDQIGYYmTRRWGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEG 
ALNTPKDHIGTRNPNNNAAmQL 

TPGSSRGNSPARMASGGGFJAIALLLLDRLNQLESKVSGKGQQQO^OIWKKSAAEAS^ 

PRQKRTATKQYWTQAFGRRGPEOT^GNFGDQDLIRG^TDYKHWPQIAQFAPSASAFFGM 

SRIGMEWPSGTWLTYHGAIKLDDKDPQFKDl^ 

EAQPLPQRQKKQPTVTLLP 
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RTRAQALIDFYLCFLAFLLFLVLIMLIIFWFSLEIQ 

CSCICTWQRCASNKPHVLEDPCKVQH-G-YL-HCLALCSRKGFTFS-MAHYGSNMHT-C 
YYQLSRSSWWCAYS-VLVPS-RSPNCCI-RR 

VDPQIQLTITRMEDMGQGQNSADPKVYPII^GSQLSLSMARRNLDSLEARAFQSTPI 
WQMTKLATTEELPDEFV\An)M-KSSAPDGTSIT-ELAQKIiHFPT 
P-IHPKTTLAPMLITMLPPC!fNFLKEQHC 
LLMVGEILLLEWLAEVVKLPSRYCC^ 

LAKNVLPQNSTTSLKHLGDWQNKPKEISGTKT-SDKELITNIGRKL^^ 

HALAWKSHLREHG-LIMEPLNWMmMSKTT^ 

KLSLCRRDKRSSPL-LFFL 



FIGURE 129 

5'3' Frame 1 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccct 

YRRLI SMMGFKMNYQVNGYP 
aatatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgat 

NMFITREEAIRHVRAWIGFD 
gtagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctagga 

V EGCHATRDAVG TN LPLQLG 
ttttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataac 

F S T G VN LVAV P T G YV D T E N N 
acagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

TEFTRVNAKPPPGDQFKHLI 



5<3> Frame 2 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttacccta 

TVDSSL-WVSK-ITKSMVTL 
atatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatg 

ICLSPAKKLFVTFVRGLALM 
tagagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggat 

-RAVMQLEMLWVLTYLSS-D 
tttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataaca 

FLQVLT - - LYRLVMLTLKIT 
cagaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

QNSPELMQNLHQVTSLNILY 
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5<3< Frame 3 

taccgtagactcatctctatgatgggtttcaaaatgaattaccaagtcaatggttaccctaa 

P-THLYDGFQNEL P S Q W L P - 
tatgtttatcacccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatgt 

YVYHPRRSYSSRSCVDWL-C 
agagggctgtcatgcaactagagatgctgtgggtactaacctacctctccagctaggatt 

R6LSCN-RCC6Y - PTSPARI 
ttctacaggtgttaacttagtagctgtaccgactggttatgttgacactgaaaataacac 

FYRC-LSSCTDWLC-H-K-H 
agaattcaccagagttaatgcaaaacctccaccaggtgaccagtttaaacatcttatacc 

RIHQS-CKTSTR-PV-TSYT 



3'5' Frame 1 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattct 

G.IRCLNWSPGGGFALTLVNS 
gtgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaa 

VLFSVST-PVGTATKLTPVE 
aatcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctct 

N P S .W R G R L V P T A S L V A - Q P S 
acatcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacata 

TSKPIHART-RIASSRVINI 
ttagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

LG-PLTW-F ILKPIIEMSLR 



3'5' Frame 2 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctg 

V-DV-TGHLVEVLH-LW-IL 
tgttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaa 

C Y F QCQHNQSVQLLS-HL-K 
atcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctcta 

ILAGEVG-YPQHL-LHDSPL 
catcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatat 

HQSQSTHERDE-LLRG--TY 
tagggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

-GNH-LGNSF-NPS-R-VYG 
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3'5' Frame 3 

ggtataagatgtttaaactggtcacctggtggaggttttgcattaactctggtgaattctgt 

YKMFKLVTWWRFCINSGEFC 
gttattttcagtgtcaacataaccagtcggtacagctactaagttaacacctgtagaaaa 

VIFSVNITSRYSY.-VNTCRK 
tcctagctggagaggtaggttagtacccacagcatctctagttgcatgacagccctctac 

S-LER-VSTHSISSCMTALY 
atcaaagccaatccacgcacgaacgtgacgaatagcttcttcgcgggtgataaacatatt 

IKANPRTNVTNSFFAGDKHI 
agggtaaccattgacttggtaattcattttgaaacccatcatagagatgagtctacggta 

RVTIDLVIHFETHHRDESTV 

FIGURE 130 

10 20 30 40 50 60 

II I I.I I. 

SEQ ID NO: 9997 KGHDIiRRLISMGFKMNYQWGY^ 

SEQ ID NO: 10034 YRRLI SMMGFKMNYQVNGYPNMF ITREEAIRHVRAWIGFDVEGCHATRDAVGTNLP 

************ ********** **** *********** ****************** 

Prim. Cons. " KGHD2miSMGFKMYQWGYPNMFITREMIRHVRAWIGFDVEGCHATRDAVGTNLP 

70 80 90 100 110 120 

I I I I I I 

SEQ ID NO: 9997 I^LGFSTGmVAVPTGYVDTENNTK^ 

SEQ ID NO: 10034 LQLGFSTGVNLVAVFTGYVDTENOT^ 

*************************.******. .****** 

Prim. cons. LQLGFSTGVNLVAVPTGYVDTENNT2FTRVNA222222QP^^ 

130 140 150 160 170 180 

I I I III 

SEQ ID NO: 9997 VQMLSDTLKGLSDRVVFVLV^GFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYAC 

SEQ ID NO: 10034 

Prim. cons. VQMLSDTLKGLSDRWFVLWAHGFELTSMKYFVKI^^ 

190 200 

I I 
SEQ ID NO: 9997 WNHSVGFDYVYNPFMIDVQQWGLYG 

SEQ ID NO: 10034 
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FIGURE 131 

5«3» Frame 1 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgaga 

QVHQNVCVL-LIPYLMTLSR 
taataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatg 

--SHKICQ-FQKWSRLQLTM 
ctgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaac 

LKFHSCFGVRMDMLKPSTQN 
tacaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaa 

YKQVKRGNQVLRCLTCTRCK 
gaatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaag 

ECFLKSVTFRIMVKMLLYQK 
gaataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttactt 

E---MSQSILNCVNT-IHLI1 
tagctgtaccctccaacatgagagttattcactttggtgctgg 

-LYPPT-ELFTLVL 



5'3' Frame 2 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagat 

RFIKMCVFCD-SFT- - LCRD 
aataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgc 

NKVT RFVSDF KSGQGYN-LC 
tgaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaact 

- N P I H A L V - GWT C - N LL PKT 
acaagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaag 

TSKS SVATRCCDA-LVQDAK 
aatgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagg 

NAS-KV-PSELW-KCCYTKR 
aataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttt 

NNDECRKV YSTVSILKYTYF 
agctgtaccctccaacatgagagttattcactttggtgctgg 

SCTLQHESYSLWCW 



5'3 f Frame 3 

caggttcatcaaaatgtgtgtgttctgtgattgatcttttacttgatgactttgtcgagata 

GSSKCVCSVIDLLLDDFVEI 
ataaagtcacaagatttgtcagtgatttcaaaagtggtcaaggttacaattgactatgct 

IKSQDLSVISKVVKVTIDYA 
gaaatttcattcatgctttggtgtaaggatggacatgttgaaaccttctacccaaaacta 

EISFMLWCKDGHVETFYPKL 
caagcaagtcaagcgtggcaaccaggtgttgcgatgcctaacttgtacaagatgcaaaga 
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QASQAWQPGVAMPNLYKMQR 
atgcttcttgaaaagtgtgaccttcagaattatggtgaaaatgctgttataccaaaagga 

MLLBKCDLQNYGENAVIPK6 
ataatgatgaatgtcgcaaagtatactcaactgtgtcaatacttaaatacacttacttta 

IMMNVAKYTQLCQYLNTLTL 
gctgtaccctccaacatgagagttattcactttggtgctgg 

AVPSNMRVIHFGA 



3 ' 5 ' Frame 1 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaag 

PAPK - ITLMLEGTAKVSVFK 
tattgacacagttgagtatactttgcgacattcatcattattccttttggtataacagca 

Y-HS-VYFATFI IIPFGITA 
ttttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaag 

FSP-F-RSHFSRSILCILYK 
ttaggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtt 

LGIATPGCHA-LACSFG-KV 
tcaacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacc 

STCPSL HQSMN EISA-S IVT 
ttgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatca 

LTTFEITDKSCDFIISTKSS 
agtaaaagatcaatcacagaacacacacattttgatgaacctg 

SKRSITEHTHFDEP 

3>5> Frame 2 

ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagt 

QHQSE-LSCWRVQLK-VYLS 
attgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcat 

IDTVEYTLRHSSLFLLV-QH 
tttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagt 

FHHNSEGHTFQEAFFASCTS 
taggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggttt 

-ASQHLVATLDLLVVLGRRF 
caacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacct 

QHVHPYTKA-MKFQHSQL-P 
tgaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaa 

-PLLKSLTNLVTLLSRQSHQ 
gtaaaagatcaatcacagaacacacacattttgatgaacctg 

VKDQSQNTHILMNL 



3'5' Frame 3 
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ccagcaccaaagtgaataactctcatgttggagggtacagctaaagtaagtgtatttaagta 
STKVNNSHVGGYS-SKCI-V 
ttgacacagttgagtatactttgcgacattcatcattattccttttggtataacagcatt 

LTQLSILCDIHHYSFWYNSI 
ttcaccataattctgaaggtcacacttttcaagaagcattctttgcatcttgtacaagtt 

FTIILKVTLFKKHSLHLVQV 
aggcatcgcaacacctggttgccacgcttgacttgcttgtagttttgggtagaaggtttc 

R H R N T W L PRLTCL-FWVEGF 
aacatgtccatccttacaccaaagcatgaatgaaatttcagcatagtcaattgtaacctt 

NMSILTPKHE-NFSIVNCNL 
gaccacttttgaaatcactgacaaatcttgtgactttattatctcgacaaagtcatcaag 

DHF-NH-QIL-LYYLDKVIK 
taaaagatcaatcacagaacacacacattttgatgaacctg 

-KINHRTHTF- - T 



FIGURE 132 

S^' Frame 1 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctct 

-VFTYPGKA NQPRSLVDLFS 
aaacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccg 

KRTN-NV--WTPIKPT-CPP 
cattacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatg 

HY IWWTHRFN - Q - PEWRTAM 
gggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttca 

GQGQNSADPKVYPIILRLGS 
cagctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaa 

QLSLSMARRNLDSLEARAFQ 
tcaacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgag 

STPIVVQMTKLATTEELPDE 
ttcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctag 

FVVVTAK-KSSAPDGTSIT- 
gaactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttg 

ELAQKLHFPTALTKKASYGL 
caactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataaca 

QLREP-IHPKTTLAPAILIT 
atgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcag 

MLPPCYNFLKEQHCQKASTQ 
agggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaatt 

REAEAAVKPLLAPHHVVAVI 
caagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggag 

QEIQLLAAVGEILLLEWLAE 
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gtggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaag 

VVKLPSRYCC-TD-TSLRAK 
tttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgagg 

FLVKANNNKAKLSLRNLLLR 
catctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcat 

HLKSLAKNVLPQNSTTSLKH 
ttgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagac 

LGDVVQNKPKEISGTKT-SD 
aaggaactgattacaaacattgggccgcaaattgcacaatttgcct 

KELITNIGPQIAQFA 



5»3» Frame 2 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctcta 

RFLPTQEKPTNLDLL-ICSL 
aacgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgc 

NEQIKMSDNGPQSNQRSAPR 
attacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatgg 

ITFGGPTDSTDNNQNGGLQW 
ggcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcac 

GKAKTAPTPRFTQ-YCVLVH 
agctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaat 

SSHSAWQG G T - IPSRPGR S.N 
caacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagt 

Q H . Q - WSR-PNWLLPKSYPTS 
tcgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagg 

SWW-RQNERAQPQMVLLLPR 
aactggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgc 

NWPRSFTSLRR-QRRHRMGC 
aactgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaa 

N- GSLEYTQRPHWHPQS- - Q 
tgctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcaga 

CCHRATTSSRNNIAKRLLRR 
gggaagcagaggcggcagtcaagcctcttctcgctcctcatcacgtagtcgcggtaattc 

GKQRRQSSLFSLLIT - SR-F 
aagaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggagg 

KKFNS WQQ-GKFSCSNG-RR 
tggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagt 

W-NCPRAIAARQIEPA-EQS 
ttctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggc 

FW-RPTTTRPNCH-EICC-G 
atctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcatt 

I-KASPKTYCHKTVQRHSSI 
tgggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagaca 

WETWSRTNPRKFRGPRPNQT 
aggaactgattacaaacattgggccgcaaattgcacaatttgcct 

RN-LQTLGRKLHNLP 
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5*3' Frame 3 

taggtttttacctacccaggaaaagccaaccaacctcgatctcttgtagatctgttctctaa 

GFYLPRKSQPTSISCRSVL- 
acgaacaaattaaaatgtctgataatggaccccaatcaaaccaacgtagtgccccccgca 

TNKLKCLIMDPNQTNVVPPA 
ttacatttggtggacccacagattcaactgacaataaccagaatggaggactgcaatggg 

LHLVDPQIQLTITRMEDCN G 
gcaaggccaaaacagcgccgaccccaaggtttacccaataatattgcgtcttggttcaca 

ARPKQRRPQGLPNNIASWFT 
gctctcactcagcatggcaaggaggaacttagattccctcgaggccagggcgttccaatc 

ALTQHGKEELRFPRGQGVPI 
aacaccaatagtggtccagatgaccaaattggctactaccgaagagctacccgacgagtt 

NTNSG. PDDQIGYYRRATRRV 
cgtggtggtgacggcaaaatgaaagagctcagccccagatggtacttctattacctagga 

RGGDG KMKELSPRWYFYYLG 
actggcccagaagcttcacttccctacggcgctaacaaagaaggcatcgtatgggttgca 

TG PEAS L PYGANKEGIVWVA 
actgagggagccttgaatacacccaaagaccacattggcacccgcaatcctaataacaat 

TEGALNTPKDHIGTRNPNNN 
gctgccaccgtgctacaacttcctcaaggaacaacattgccaaaaggcttctacgcagag 

AATVLQLPQGTT- LP:KGFYAE 
ggaagcagaggcggcagtcaagcctcttdtcgctcctcatcacgtagtcgcggtaattca 

GSRGGSQASSRSSSRSRGNS 
agaaattcaactcctggcagcagtaggggaaattctcctgctcgaatggctagcggaggt 

RNSTPGSSRGNSPARMASG.G 
ggtgaaactgccctcgcgctattgctgctagacagattgaaccagcttgagagcaaagtt 

GETALALLLLDRLNQLESKV 
tctggtaaaggccaacaacaacaaggccaaactgtcactaagaaatctgctgctgaggca 

SGKGQQQQ G Q TVTKKSAAEA 
tctaaaaagcctcgccaaaaacgtactgccacaaaacagtacaacgtcactcaagcattt 

SKKPRQKRTATKQYNVTQAF 
gggagacgtggtccagaacaaacccaaggaaatttcggggaccaagacctaatcagacaa 

GRRGPEQTQGNFGDQDLIRQ 
ggaactgattacaaacattgggccgcaaattgcacaatttgcct 

GTDYKHWAANCTIC 



3*5* Frame 1 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtct 
RQIVQFAAQCL-SVPCLIRS 

tggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacg 
WSPKFPWVCSGPRLPNA-VT 

ttgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagat 
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LYCFVAVRFWRGFLDASA AD 
ttcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagc 

FLVTVWPCCCWPLPETLLSS 
tggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccatt 

WFNLSSSNSARAVSPPPLAI 
cgagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgacta 

RAGEFPLLLPGVEFLELPRL 
cgtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcct 

RDEEREEA-LPPLLPSA-KP 
tttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattg 

FGNVVP - GSCSTVAALLLGL 
cgggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatg 

RVPMWSLGVFKAPSVATHTM 
ccttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaag 

PSLLAP-GSEASGPVPR--K 
taccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagct 

YHLGLSSFILPSPPRTRRVA 
cttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctgg 

LR - - PIWS S.GPL LVL IGTPW 
cctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgca 

PRGNLSSSfcPC-VRAVNQDA 
atattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcc 

ILLGKPWGRRCFGLAPLQSS 
attctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacg 

ILVIVS-ICGS T K C N A G G T T 
ttggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatcta 

LV-LGSIIRHFNLFV-RTDL 
caagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

Q EIEVGWLFLGR-K.P 



3'5' Frame 2 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtctt 
GKLCNLRPNVCNQFLV - LGL 
ggtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgt 

GPRNFLGFVLDHVSQMLE-R 
tgtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagatt 

CTVLWQYVFGEAF-MPQQQI 
tcttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagct 

S - - QFGLVVVGLYQKLCSQA 
ggttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattc 

GSICLAAIARGQFHHLR-PF 
gagcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactac 

EQENFPYCCQELNFLNYRDY 
gtgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagcctt 

VMRSEKRLDCRLCFPLRRSL 
ttggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgc 
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LAMLFLEEVVARWQHCY-DC 
gggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgc 

GCQCGLWVYS RLPQLQPIRC 
cttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagt 

LLC-RRREVKLLGQFL6NRS 
accatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctc 

TIWG-ALSFCRHHHELVG-L 
ttcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggc 

PGSSQFGHLDHYWC-LERPG 
ctcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaa 

LEGI-VPPCHAE-EL-TKTQ 
tattattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctcca 

YYWVNLGVGAVLALPHCSPP 
ttctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgt 

FWLLSVESVGPPNVMRGALR 
tggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctac 

WFDWGP LSDILICSFREQIY 
aagagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

K. R SRLVGFSWVGKNL 



3*5* Frame 3 

aggcaaattgtgcaatttgcggcccaatgtttgtaatcagttccttgtctgattaggtcttg 

A'NCAICGPMFV ISSLSD-YL 
gtccccgaaatttccttgggtttgttctggaccacgtctcccaaatgcttgagtgacgtt 

VPEISLGLFWTTSPKCLSDV 
gtactgttttgtggcagtacgtttttggcgaggctttttagatgcctcagcagcagattt 

VLFCGSTFLARL FRCL SSRF 
cttagtgacagtttggccttgttgttgttggcctttaccagaaactttgctctcaagctg 

LSDSLALLLLAFTRNFALKL 
gttcaatctgtctagcagcaatagcgcgagggcagtttcaccacctccgctagccattcg 

VQSV-QQ-REGSFTTSASHS 
agcaggagaatttcccctactgctgccaggagttgaatttcttgaattaccgcgactacg 

SRRISPTAARS-IS-ITATT 
tgatgaggagcgagaagaggcttgactgccgcctctgcttccctctgcgtagaagccttt 

- - GARRGLTAASASLCVEAF 
tggcaatgttgttccttgaggaagttgtagcacggtggcagcattgttattaggattgcg 

WQCCSLRKL-HGGSIVIRIA 
ggtgccaatgtggtctttgggtgtattcaaggctccctcagttgcaacccatacgatgcc 

GANVVFGCIQGSLSCNPYDA 
ttctttgttagcgccgtagggaagtgaagcttctgggccagttcctaggtaatagaagta 

FFVSAVGK-SFWASS-VIEV 
ccatctggggctgagctctttcattttgccgtcaccaccacgaactcgtcgggtagctct 

PSGAELFHFAVTTTNSSGSS 
tcggtagtagccaatttggtcatctggaccactattggtgttgattggaacgccctggcc 

SVVANLVIWTTIGVDWNALA 
tcgagggaatctaagttcctccttgccatgctgagtgagagctgtgaaccaagacgcaat 
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SRESKFLLAMLSESCEPRRN 
attattgggtaaaccttggggtcggcgctgttttggccttgccccattgcagtcctccat 

IIG-TLGSALFWPCPIAVLH 
tctggttattgtcagttgaatctgtgggtccaccaaatgtaatgcggggggcactacgtt 

SGYCQLNLWVHQM-CGGHYV 
ggtttgattggggtccattatcagacattttaatttgttcgtttagagaacagatctaca 

GLIGVHYQTF-FVRLENRST 
agagatcgaggttggttggcttttcctgggtaggtaaaaaccta 

RDRGWLAFPG-VKT 
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FIGURE 134 
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